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Preface 


Elementary particle physics has made remarkable progress in the past ten 
years. We now have, for the first time, a comprehensive theory of particle 
interactions. One can argue that it gives a complete and correct description of 
all non-gravitational physics. This theory is based on the principle of gauge 
symmetry. Strong, weak, and electromagnetic interactions are all gauge 
interactions. The importance of a knowledge of gauge theory to anyone 
interested in modern high energy physics can scarcely be overstated. 
Regardless of the ultimate correctness of every detail of this theory, it is the 
framework within which new theoretical and experimental advances will be 
interpreted in the foreseeable future. 

The aim of this book is to provide student and researcher with a practical 
introduction to some of the principal ideas in gauge theories and their 
applications to elementary particle physics. Wherever. possible we avoid 
intricate mathematical proofs and rely on heuristic arguments and illustrative 
examples. We have also taken particular care to include in the derivations 
intermediate steps which are usually omitted in more specialized communi- 
cations. Some well-known results are derived anew, in a way more accessible 
to a non-expert. 

The book is not intended as an exhaustive survey. However, it should 
adequately provide the general background necessary for a serious student 
who wishes to specialize in the field of elementary particle theory. We also 
hope that experimental physicists with interest in some general aspects of 
gauge theory will find parts of the book useful. 

The material is based primarily on a set of notes for the graduate courses 
taught by one of us (L.F.L.) over the past six years at the Carnegie—Mellon 
University and on lectures delivered at the 1981 Hefei (China) Summer 
School on Particle Physics (Li 1981). It is augmented by material covered in 
seminars given by the other author (T.P.C.) at the University of Minnesota 
and elsewhere. These notes have been considerably amplified, reorganized, 
and their scope expanded. In this text we shall assume that the reader has had 
some exposure to quantum field theory. She or he should also be moderately 
familiar with the phenomenology of high energy physics. In practical terms 
we have in mind as a typical reader an advanced graduate student in 
theoretical physics; it is also our hope that some researchers will use the book 
as a convenient guide to topics that they wish to look up. 

Modern gauge theory may be described as being a ‘radically conservative 
theory’ in the sense used by J. A. Wheeler (see Wilczek 19825). Thus, one 
extrapolates a few fundamental principles as far as one can, accepting some 
‘paradoxes’ that fall short of contradiction. Here we take as axioms the 
principles of locality, causality, and renormalizability. We discover that a 
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certain class of relativistic quantum field theory, i.e. the gauge theory, 
contains unexpected richness (Higgs phenomena, asymptotic freedom, 
confinement, anomalies, etc.), which is necessary for an understanding of 
elementary particle interactions. And yet, this does not occasion any revision 
of the basic principles of relativity and quantum mechanics. Thus the 
prerequisite for the study of gauge theory is just the traditional preparation in 
advanced quantum mechanics and quantum field theory, especially the 
prototype gauge theory of quantum electrodynamics (QED). 

The book is organized in two parts. Part I contains material that can be 
characterized as being ‘pre-gauge theory’. In Chapters 1, 2, and 3 the basics 
of relativisitic quantum field theory (quantization and renormalization) are 
reviewed, using the simple 1¢* theory as an illustrative example. In Chapters 
4 and 5 we present the elements of group theory, the quark model, and chiral 
symmetry. The interrelationship of the above main topics—renormalization 
and symmetry—is then studied in Chapter 6. The argument that quarks are 
the basic constituents of hadrons is further strengthened by the discovery of 
Bjorken scaling. Scaling and the quark—parton model are described in 
Chapter 7. These results paved the way for the great synthesis of particle 
interaction theories in the framework of the non-Abelian gauge theories, 
which is treated in Part II. After the classical and quantized versions of gauge 
theories are discussed in Chapters 8 and 9, we are then ready for the core 
chapters of this book—Chapters 10-14—-where gauge theories of quantum 
chromodynamics (QCD), quantum flavourdynamics (QFD), and grand 
unification (GUT) are presented. As a further illustration of the richness of 
the gauge theory structure we exhibit its nonperturbative solutions in the 
form of magnetic monopoles and instantons in Chapters 15 and 16. 

We have also included at the end of the book two appendices. In Appendix 
A one can find the conventions and normalizations used in this book. 
Appendix B contains a practical guide to the derivation of Feynman rules as 
well as a summary of the propagators and vertices for the most commonly 
used theories—the 16*, Yukawa, QCD, and the (R; gauge) standard model 
of the electroweak interaction. 

In the table of contents we have marked sections and chapters to indicate 
whether they are an essential part (unmarked), or details that may be omitted 
upon a first reading (marked by an asterisk), or introductions to advanced 
topics that are somewhat outside the book’s main line of development 
(marked by a dagger). From our experience the material covered in the 
unmarked sections is sufficient for a one-semester course on the gauge theory 
of particle physics. Without omitting the marked sections, the book as a 
whole is adequate for a two-semester course. It should also be pointed out 
that although we have organized the sections according to their logical 
interconnection there is no need (it is in fact unproductive!) for the reader to 
strictly follow the order of our presentation. For example, §1.2 on path 
integral quantization can be postponed until Chapter 9 where it will be used 
for the first time when we quantize the gauge theories. As we anticipate a 
readership of rather diverse background and interests, we urge each reader to 
study the table of contents carefully before launching into a study pro- 
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gramme. A certain amount of repetition is deliberately built into the book so 
that the reader can pick and choose different sections without any serious 
problems. An experimentally inclined reader, who is not particularly 
interested in the formal aspects of relativistic quantum field theory, can skip 
Chapters 1, 2, 3, and 6 on quantization and renormalization. After an 
introductory study of group theory and the quark model in Chapters 4, 5, 
and 7 she or he should proceed directly to the parts of Chapters 8, 10, 11, 12, 
14, etc. where a general introduction to and applications of gauge theory 
can be found. 

The sections on references and bibliography at the end of the book 
represent some of the commonly cited references that we ourselves are 
familiar with. They are not a comprehensive listing. We apologize to our 
colleagues who have been inadequately referenced. Our hope is that we have 
provided a sufficient set so that an interested reader can use it to go on to find 
further reviews and research articles. 

It is a pleasure to acknowledge the aid we have received from our 
colleagues and students; many have made helpful comments about the 
preliminary version of the book. We are very grateful to Professor Mahiko 
Suzuki who undertook a critical reading of the manuscript, and also to 
Professors James Bjorken, Sidney Drell, Jonathan Rosner, and Lincoln 
Wolfenstein for having encouraged us to begin the conversion of the lecture 
notes into a book. One of us (T.P.C.), would like to thank the National 
Science Foundation, UMSL Summer Research Fellowship Committee, and 
the Weldon Spring Endowment for support. During various stages of 
working on this project he has enjoyed the hospitality of the theoretical 
physics groups at the Lawrence Berkeley Laboratory, the Stanford Linear 
Accelerator Center and the University of Minnesota. L.F.L. would like to 
thank the Institute for Theoretical Physics at the University of California— 
Santa Barbara for hospitality and the Department of Energy and the Alfred 
P. Sloan Foundation for support. Finally, we also gratefully acknowledge the 
encouragement and help given by our wives throughout this project. And, we 
are much indebted to Ms Susan Swyers for the painstaking task of typing this 
manuscript. Other technical assistance by Ms Tina Ramey and Mr Jerry 
McClure is also much appreciated. 


Note added in proof. As this manuscript was being readied for publication we 
received the news that the CERN UA1 and UA2 groups have observed events 
in pp collisions which may be interpreted as the production of an 
intermediate vector boson W with a mass approximately 80 GeV. Also, the 
Irvine~Michigan—Brookhaven collaboration reported a preliminary result 
setting a lower bound for the lifetime t(p > e* 2°) > 6.5 x 10°! years. 


St. Louis and Pittsburgh LPC, 
September 1982 L.F.L. 
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Part I 


Basics in field quantization 


The dynamics of a classical field @(x) are determined by the Lagrangian 
density Z (p, 0,@) through the action principle 


ôS = 0 (1.1) 


where S is the action 
S = [szo 0,0). 


This extremization leads to the Euler-Lagrange equation of motion 


Oe, eee 
"5@,6) ò$ 


To quantize a system we can adopt either of two equivalent approaches. 
The canonical formalism involves the identification of the true dynamical 
variables of the system. They are taken to be operators and are postulated to 
satisfy the canonical commutation relations. The Hamiltonian of the system 
is constructed and used to find the time evolution of the system. This allows 
us to compute the transition amplitude from the state at an initial time to the 
state at final time. Alternatively, we can use the Feynman path-integral 
formalism to describe the quantum system. Here the transition amplitude is 
expressed directly as the sum (a functional integral) over all possible paths 
between the initial and final states, weighted by the exponential of i times the 
action (in units of the Planck’s constant A) for the particular path. Thus in the 
classical limit (h > 0) the integrand oscillates greatly, making a negligible 
contribution to the integral except along the stationary path selected by the 
action principle of eqn (1.1). 

In this chapter we present an elementary study of field quantization. First 
we review the more familiar canonical quantization procedure and its 
perturbative solutions in the form of Feynman rules. Since we will find that 
gauge field theories are most easily quantized using the path-integral 
formalism we will present an introduction to this technique (and its 
connection to Feynman rules) in §1.2. For the most part the simplest case of 
the self-interacting scalar particle will be used as the illustrative example; 
path-integral formalism for fermions will be presented in §1.3. 

Since the path-integral formalism will not be used until Chapter 9 when we 
quantize the gauge fields, the reader may wish to postpone the study of §§1.2 
and 1.3 until then. It should also be pointed out that even for gauge theories 
we shall use these two quantization formalisms in an intermixed fashion. By 
this we mean that we will use whatever language is most convenient for the 
task at hand, regardless of whether it implies path-integral or canonical 


= 0. (1.2) 
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quantization. For example, in the discussion of the short-distance pheno- 
mena in Chapter 10, we continue to use the language of ‘operator product 
expansion’ even though strictly speaking this implies canonical quantization. 
The reader 1s also referred to Appendix B at the end of the book where one 
can find a practical guide to derivation of Feynman rules via path-integral 
formalism. 


1.1 Review of canonical quantization formalism 


We assume familiarity with the transition from a classical nonrelativistic 
particle system to the corresponding quantum system. The Schrödinger 
equation is obtained after we replace the canonical variables by operators 
and the Poisson brackets by commutators. These operators act on the 
Hilbert space of square integrable functions (the wavefunctions), and they 
satisfy equations of motion which are formally identical to the classical 
equations of motion. 

A relativistic field may be quantized by a similar procedure. For a system 
described by the Lagrangian density (@, 0,0), the field @(x) satisfies the 
classical equation of motion given in eqn (1.2). We obtain the corresponding 
quantum system by imposing the canonical commutation relations at equal 
time 


[ (x, ©, O(x’, D] = —i67°(x — x’) 
[ n(x, t), n(x’, t)] = Lox, t), p(x’, t)] =0 (1.3) 


where the conjugate momentum is defined by 


ÔL 
n(x) = TR (1.4) 
The Hamiltonian 
H = | stacy doP(x) — L(x)] (1.5) 
governs the dynamics of the system 
Oo (x, t) T iL H, P(x, t) | 
Oon(x, t) = iL, x(x, À]. (1.6) 


Example 1.1. Free scalar field. Given the Lagrangian density 
L = 31(0,6)(0*¢) — uo], 
eqn (1.2) yields the Klein—Gordon equation 
(0° + p*)p(x) = 0. (1.7) 


In quantum field theory the field ¢(x) and its conjugate momentum operators 
given by eqn (1.4), 2(x) = 0o¢(x), satisfy the canonical commutation 
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relations 
[AoG(x, 1), ox’, NJ = —id°*(x — x’) 
LOoP(x, t), Q(x’, t)] = [p(x, t), p(x’, t) | = 0. (1.8) 


The Hamiltonian is given by 


Ho = DET + (VE) + u’ o°]. (1.9) 


The time evolution equation (1.6), which is basically Hamilton’s equation of 
motion, can be cast in the form of (1.7). Thus the field operator (x) formally 
satisfies the Klein-Gordon equation. This simple non-interacting case can be 
solved and we have 


d°k i(k-x — — i(k:x — @,t 
p(x, t) = |a [a(k) ef *-) 4 a(k) e70] (1.10) 


where œ, = (k? + u?)*?. The coefficients of expansion a(k) and a(k) are 
operators. The canonical commutation relations of eqn (1.8) are transcribed 
into 

[a(k), a'(k’)] = 6°(k — k’) 


[a(k), a(k’)] = [a'(k), a(k’)] = 0 (1.11) 


and the Hamiltonian of eqn (1.9) can be expressed as 
H = | d3kw,a'(k)a(k) (1.12) 


where we have discarded an irrelevant constant. Remembering the situation 
of the harmonic oscillator, we see immediately that a(k) and a'(k) can be 
interpreted as destruction and creation operators. Thus the one-particle state 
with momentum k is given by the creation operator acting on the vacuum 
state 


Ik> = [(27)*2e,]"/7a"(k)|0> (1.13) 
where the normalization 1s 
<k'|Ik> = (272)°2m, (k — K’). 


The product ata has the usual interpretation as a number operator and eqn 
(1.12) shows that H> is the Hamiltonian for a system of non-interacting 
particles. 

Given the solution, (1.10), and (1.11), we can easily calculate the Feynman 
propagator function, which is the vacuum expectation value for a time- 
ordered product of two fields, 


iA(x, — x2) = COlT(P(%1)O(%2))10> 
= Ot, — t2)<OlO™%1)O(%2 OD + A(t. — 11) O1O(x2)O(%1)|0> 


d*k i i i 
= enaa E a SKa) (1.14) 
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Example 1.2. Scalar field with 1¢* interaction. The Lagrangian density is 
given by 


Y = AOAN — 261 - Fo 
the equation of motion is 
(2? + P = — 4 O: (1.15) 
The conjugate momentum and canonical commutation relations are the 


same as those for the free-field case in Example 1.1. The Hamiltonian is of the 
form 


H = HM + H' (1.16) 
where H is given by eqn (1.9) and 
H' = [er 
where 
, À 4 
KH =a? (1.17) 


is the interaction Hamiltonian density. Since the free-field theory is soluble 
we can obtain transition amplitudes and matrix elements of physical interest 
by a systematic expansion in 4. This approximation scheme of perturbation 
theory will be briefly outlined below. 

In the usual Heisenberg picture the operators are time-dependent and the 
time evolutions of the dynamical variables of the system are governed by the 
Hamiltonian 


(x, t) z el (x, 0) e` iHi 
n(x, t) = e”'n(x, 0) e7”. 


The state vector |a> is time-independent. But, in the Schrödinger picture, the 
operators are time-independent and state vectors carry time dependence. 
They are related to those in the Heisenberg picture by 


P(x) = eo, 1) e 
n(x) = e "n(x, t) e” 
la, tS =e "lay. 


In perturbation theory we introduce another picture—the interaction 
picture—with operators and states defined by 


= eitot e "(x f) eit! e` Hot 


= U(t, 0)6(x, NUT (t, 0) (1.18) 
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Similarly, for z'(x, ®, 


la, ©! = "tja, YS 
= U(t, Ojay (1.19) 
where 
U(t, 0) = e7% eo (1.20) 


is the unitary time-evolution operator. Since the operators in the interaction 
picture (1.18) satisfy the (soluble) free-field equation 


dod’ (x, t) = iLHo, Ox, 1] 

Oon'(x, t) = iL, n(x, À], (1.21) 
the dynamic problem in this language becomes that of finding the solution 
for the U-matrix. 


The time-evolution operator U can be defined more generally than in 
(1.20), 


la, tX = U(t, to)la, to >’ (1.22) 
where U(to, to) = 1 and satisfies the multiplication rule, 
U(t, UH, to) = U(t, to) 
U(t, O)U~ (to, 0) = U(t, to). (1.23) 
The equation of motion for the U-operator can be deduced from eqns (1.19) 
and (1.20) 
_ 0 
in U(t, to) = H" (UC, to) (1.24) 
where 
H" a e'o H' e` ot 
is the interaction Hamiltonian in the interaction picture, i.e. 
H" = H'(¢'). (1.25) 
Eqn (1.24) has the solution 


t 


U (t, to) = Texp| —i fanaa) 


to 
f 


= Tex] i [an [dreo 1) (1.26) 


to 


which can be expanded in a power series 
t t 


© (=i) 4 4 
U(t, to) = 1 + d*x, Ge oma 
t to to 


x | d*x,T( "(x )H'(X2) «+» #"x,)). (1.27) 


to 
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Green’s functions in 1¢* theory. Next we need to translate this formal 
perturbative solution into quantities that will have more direct physical 
meaning. 

In field theory we are interested in calculating Green’s function defined by 


Gxi,- Xn) = OTO: ), ---, Pn) (1.28) 


from which the S-matrix elements can be readily obtained. By a straightfor- 
ward applicatión of eqns (1.18), (1.23), and (1.26) we have 


G(x, ..., Xa) = COUT (t, OTCA x) ... b'(x,) 


x exp| —i | avn) pur 0)|0> (1.29) 
—t 
where f is some reference time which we shall eventually let approach oo. In 
this limit the vacuum state becomes an eigenstate of the U-operator, and the 
eigenvalue product of the two Us in (1.29) becomes 


l 


coir (exp| =i | i avn") |)o> 


The effect of these two Us acting on the vacuum states is to take out ‘the 
disconnected part’ of the vacuum expectation value (see, for example, 
Bjorken and Drell 1965). Also, after we substitute the power series expansion 
of (1.27), the n-point Green’s function with the notation of (1.25) takes on the 
form 


(1.30) 


oO 


dyi, -> dyp COTE), ---, On) 


x H'($'(V1)) -HPOO (1.31) 


The subscript c denotes ‘the connected part’. The terminology clearly reflects 
features in the graphic representation of the Green’s function. 

Consider the simplest example of a first-order (p = 1) term for a four- 
point (n = 4) Green’s function in the theory with X '(p) = 1/4! * as in (1.17) 


1A 
GM) ta) = | ALOT), -s PEPO. (1.32 


We then normal-order the entire time-ordered product by moving the 
creation operators to the left of the annihilation operators and eliminating 
those terms which end up with the annihilation operator on the right and/or 
the creation operator on the left. After this application of Wick’s theorem 
(1950) the connected part of the expression in (1.32) decomposes into a 
product of two-point functions 


GP(X1, -- +5 X4) = (— iå) [ason ($'(x1)6'())10><O| T(P'(x2)$'())10 
CO|T($'(3)b'())10<0|T(P'(x4)P')10>. (1.33) 
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The original 4! factor in the denominator of (1.32) is cancelled because there. 
are a corresponding number of ways to contract the ¢'(x;)s with each field in 
[¢'\(y)]*. The interaction picture field obeying the free-field equation, the 
propagator iA(x, y) = <0|7(¢'(x)¢'())|0), is a known quantity and is given 
by eqn (1.14). A graphic representation of the expression in (1.33) is shown in 
Fig. 1.1. 


xy X2 


Fic. 1.1. Graphic representation of eqn (1.33). 


We next consider the example of the second-order term for the four-point 
function, i.e. the p = 2, n = 4 term in eqn (1.31) 


a 
GPx, 1X4) = a1 (=) arn d*y,<0|T(¢'(x1) e. p(x Epy) 
x [6'(72)])10>. (1.34) 


We then use Wick’s theorem to reduce it and keep only the connected parts, 
G(x, ... X4) 


l ; ; 
= z1 (—id) fay d*y,[iAQ,, yo)? EAC, yı) A(x2, yi) ] 
x [A(x3, y2) A(x4, y2) + LAG, y1) As, y DJLA(x2, Y2) A(x4, ¥2)] 
l 
+ [A(x1, y1) A(x4, V1) TAC 2, y2) A(x3, y2)]} + 1 (— i4) fas d*y, 


x [1A0,1, y) JOAO, yr) A, Yı) A(x2, V2) A(X3, V2) A(x4, Y2) 
+ A(X1, Y2) A(x2, y1) A(X3, Y2) A(x4, Y2) + A(x1, Y2) A(x2, y2) 


x A(x3, Y1) A(x4, Y2) + Ay, y2) A(x2, V2) A(x3, Y2) A(x4, yi)}. (1.35) 


The symmetry factor 2! in the first term on the right-hand side of (1.35) can 
be understood as follows. The original factors of (1/2!)(1/4!)? in eqn (1.34) 
are cancelled by the permutation of y; and y, and by the multiple ways in 
which we can attach the fields emanating out of each vertex. However, this is 
an over-counting. Since there are two identical internal lines connecting a 
pair of vertices, we need to divide out a factor of 2!. The factor 1/2 in the 
second term of (1.35) has a different origin. It comes from the fact that an 
internal line starts and terminates in the same vertex. Eqn (1.35) is shown 
graphically in Fig. 1.2. 
Similarly, for a first-order (p = 1) two-point (n = 2) function, we have 


— i 
GPC. x2) = T | dyi, DIGAO, DIA] (1.36) 


as shown in Fig. 1.3. 
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Xi Xa X X? X] X2 


X3 X4 X3 X4 x; X4 


X Xa X2 


X3 X 


Fic. 1.2. Graphic representation of eqn (1.35). 


xy y X2 


Fic. 1.3. Graphic representation of eqn (1.36). 


Usually it is more convenient to work with Green’s function in momentum 
space 


(27)* 6*(p, +... +p, )G(p1 ... Da) = ji dfx; eT PIG (x, ...x,) 
2 (1.37) 
and with the amputated Green’s function, which is related to G™(p, ...p,) 
by removing the propagators on external lines 


i 


Gimp(P1 +» Pn) = fi R G”(P1 --- Pn) (1.38) 


where pi +p. +...p=0. In fact for spin-0 particles the amputated 
Green’s function is just the usual transition amplitude (the 7-matrix element) 
from which the cross-section can be directly computed. 


Feynman rules of A¢* theory. The result of perturbation theory may be 
conveniently summarized in terms of the Feynman rules for the transition 
amplitude. With the #7, = (A/4!)@*(x) interaction, we have the following 
prescription for calculating the N-point amputated Green’s function. 


1. Draw all possible connected, topologically distinct, graphs with N 
external lines; 
2. For each internal line, put in the propagator factor 


0) ——— —*: Ap) = 
p?—p'tie 
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For each vertex, 


(ii) —iA; 


3. For each internal momentum / not fixed by momentum conservation at 
each vertex, perform an integration f d*1/(2z)*; 

4. Each graph has to be divided by a symmetry factor S corresponding to the 
number of permutations of internal lines one can make for fixed vertices. 


1.2 Introduction to path integral formalism 


In the first section the canonical quantization procedure via operator 
formalism was briefly reviewed. We have outlined the steps through which 
the perturbative solution of an interacting field theory may be obtained in the 
form of Feynman rules. In this section the same set of rules will be recovered 
using the path-integral (PI) formalism (Dirac 1933; Feynman 1948a; 
Schwinger 1951b). This alternative quantization approach has the advantage 
of exhibiting a closer relationship to the classical dynamical description and 
the manipulation involves only ordinary functions. This allows us to see 
more clearly the effect of any nonlinear transformations on the fundamental 
variables. Thus the PI formalism is particularly suited for handling 
constrained systems such as gauge theories. 


Quantum mechanics in one dimension 


We first introduce the PI formalism in the simplest quantum-mechanical 
system in one dimension. Generalization to field theory with infinite degrees 
of freedom, together with its perturbative solution, will be presented in a later 
part of this section. 

In quantum mechanics a fundamental quantity is the transition matrix 
element corresponding to the overlap between initial and final stages 


q's qs t> = <q'be "gy (1.39) 


where the |qg)s are eigenstates of the position operator Q in the Schrodinger 
picture with eigenvalue q 


Qlq> = qlq> (1.40) 


and the |g;¢>s on the left-hand side of (1.39) denote the states in the 
Heisenberg picture, |g; ty = e'”lq>. It should be remembered that the 
Heisenberg-picture states do not carry time-dependance. The notation used 
here means that the Heisenberg-picture states |g; ty and |q’; t> in eqn (1.39) 
coincide with two distinctive Schrodinger-picture states |g(t)> and |q'(t)> at 
time ¢ and ?¢’ respectively. In the PI formalism the transformation matrix 
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element of eqn (1.39) is written as a functional integral 
r 
<q; t'lq; t> = N [raa expli | L(q, å) act (1.41) 


t 


where N is the normalization factor and L(q, q) is the Lagrangian. The 
integration is performed in the function space q(t). It represents the sum of 
contributions over all paths that connect (q, t) and (q’, t'), weighted by the 
exponential of i times the action. In the following we will derive (1.41) using 
the familiar canonical-operator formalism. The definition of the integration 
measure [dq] will be given in eqns (1.50) and (1.51) and this should clarify the 
meaning of the functional integral. 

We first divide the interval (t,t) inte n segments with space ôt = 
(t’ — t)/n. Then the transition amplitude in eqn (1.39) may be written 


(qe "engs [aa rn Pee E eae) eee lend |’ as 
Xue <qybe- gp (1.42) 


where we have inserted complete sets of eigenstates of the Schrödinger 
picture operator Q*. For sufficiently small ôt, 


<q'le~ "l> = Cq'I[1 — iH(P, Q) ôq) + O’. (1.43) 


If the Hamiltonian has the form 


2 


H(P, Q) = = + V(Q), (1.44) 


then 


(T|H(P, Q)la> 


Il 


Pe ig + q' 
la'l — la) +V ae 0(q — q') 
2m 2 
dp j Pp? qtq dp ip(q'—q) 
z <q'|p><pl lq> + 5 ) [ee 


dp ip(q’— 4) De, q+q 
= [ee mo ay: 


We have used <q’|g> = 6(q’ — q) and <g|p> = e1. Also, symmetric ordering 
of operators in V(Q) is assumed. Then 


i dp ._ p? q+q 
' iH ôt Zs 1p(q' — q) a, — +4 yV 


~ E elP(4'— 4) eTil p,a +qa'/2) (1.45) 
T 
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Thus, H(p, q) is the classical Hamiltonian. Substituting into (1.42), we have 


asp tee dp dp 
/ iH(t’ —t) ag 1 n 
<q'le >= (Z) (Ze) (dar. days 


x exp 2 |n Ga) att (,, 2A B=) I. 


i=1 : 
(1.46) 


The transition amplitude can then be written symbolically as 


t’ 


Oo dp d , 
Kg'je 91g) = [3] exp] | arrr =H (p.a)]} (1.47) 
l d dp, | 
= (E) (E) fon tadga 
x exp; 2 a p( 25) — B( px ata) h (1.48) 


The second line defines the path integral. We almost have the promised 
result of eqn (1.41) if we can perform the momentum-space [dp/2z] = 
Il? dp;/2x part of the path integral. The integrand being oscillatory, we 
analytically continue it to Euclidean space by formally treating (i ôt) as real. 
The Gaussian integral formula 


co 


ob aera ree nE l b?/4a 
| z7 e see e (1.49) 


= eo 


can then be used to obtain 


dp; —10t ,.., m \t/? im(q; — qi-1)” 
E ap er FARS a~) E & z) AE 2ôt 


In this way we have for egn (1.48) 


a m 'nj2 n-1 
, —iH(t'-t = li d i 
<q'le la> = lim & z) l] qi 


=> 0 


ee m [qi — Wi-1 . 
< exp p> a 5 ( ôt ) 7 r) (1.50) 


t’ 


<q; tlg’; t> = <q'l e7” lg) = N [tad exp | ao g= ra) | 


t 


Or 


(1.51) 
which is the stated result of eqn (1.41), where L = (mg?/2) — V(q). 
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Green’s functions of one-dimensional quantum mechanics. We will next 
translate the basic result of eqns (1.41) and (1.47) into forms that can be easily 
generalized to PI formulae for Green’s functions in field theory. 

Let us start with the simplest two-point function: the matrix element of a 
time-ordered product between ground states, 


Gt, t2) = COTO") Q"(t2))10> 


where |0> denotes the ground state. Inserting complete sets of states, 


G(t1, t2) = [ae dq’<0lq’s <4‘ ITQ”); <4; t10). (1.52) 


The matrix element 
<Olg; t> = polg) e7! = pola, t) (1.53) 


is the wavefunction for the ground state. We next concentrate on the PI 
formulation of <q’; ¢'|\T(Q"(t,)O"(t2))lg;. For ti >t, (ie, >t, > 
t, > t), we have 


<q’; ITOO" las > = <q’ eT EET eM DOS e~ Ma ~ 21g» 
p | <q) elgg >q Q" eM" g> 


x <q2|Q5 e721) dq, dq2. 


Taking eigenvalues in the Schrödinger picture and applying the basic PI 
result of eqn (1.47), it follows that 


<q’; PITO” E)”; D 


dp d n 
= || ad qı(tı)q2(t2) exp | dt[ pg — H(p, DI} (1.54) 


A minute of thought will convince us that exactly the same PI formula holds 
for the time sequence t, > t, (G.e., t > t, > t, >t). Thus eqn (1.54) is a 
general result. Substituting eqns (1.54) and (1.53) into eqn (1.52) we have 


G(t,, t2) = [ac dq’oo(q’, t)O$(g, £) [s2] qı(tı)q2(t2) 


x exp | dtl pq — Moa} (1.55) 


t 
or 


dq d 
G(ti, t2) - [SE o CVO, q (tı )q2(t2) 


t’ 


x exp | dt[ på — H(p, ot (1.56) 


t 
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The presence of the ground-state wavefunctions ¢o(q’, t’) and @$(q, t) in eqn 
(1.55) makes it clumsy to do practical calculations. To remove them, consider 
the matrix element 


V3 tlO, t2)lq; t> = {a0 dQ’<q'; '|Q’; T'Y 


x <Q"; T|O(ty, t2)Q; T)<Q; Tq; t> (1.57) 


where O(t,, t2) = T(O"(t,)O"(t,)) and t' > T' > (ti, t2) => T >t. Let |n> be 
the energy eigenstate with energy E, and wavefunction @,(q), 


H\n> = E,\n> 
<qin> = On (9). 


Then we have 
cq’ t's T = <q'le™ MIO’) = a nale eg} 
=) rP) eH". (1.58) 


To isolate the ground-state wavefunction in this equation, we use the fact 
that E, > Eo for all n 4 0, and take the limit t + —100, which yields 


lim <q’; tQ’; T> = PPQ) e75! e"o. (1.59) 
Similarly, 
lim <Q; Tig; t> = P(O) e75" e-¥07, (1.60) 


Then eqn (1.57) becomes 
lim <q’; t'|O(t,t2)lq; t> = fao d0'$3(7')b0(Q')KQ'; T'|O(t,12)|0, TY 


ae 
x p*(O)d,(qg) e7 Eolt'l etiEoT e iEoT a Eolll 
= £8(q')ho(g) e75"! e7 “NG (t,, t2) (1.61) 
where we have used eqn (1.52). From eqns (1.59) and (1.60) it is clear that 
lim <q';t'lq;t> = plg polg) e7! e75. (1.62) 


Combining eqns (1.61) and (1.62), we obtain for Green’s function 


G(ti,t,)= lim eer oet: | 
<q’; t'lq; t> 


t> -io 
t >10 


| dq 2| 
= lim ———— | | —— la(tı)q(t2) 
Jim ata || TE aa 


t’ 


x exp] | rtp — H(p, D} (1.63) 


t 
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where we have used eqn (1.54) and the path-integral representation for the 
factor <q’; t' |q; ty in the denominator is given in eqn (1.51). This clearly can 
be generalized to the n-point Green’s function 


G(ti .-- ty) = <O|T(q(ts alta)... Gta )NOD 


= 1 dq dp 
a nie Q5 TI; > Plas [52 faco- Qt) 


t’ 


x exp | dt[ pq — H (p, a (1.64) 


t 


This entire set of Green’s functions can be generated as follows. 


(—1)" ð WLI] 
OJ(t,)... OS(t,) 


G(t,...4,) = (1.65) 


J=0 


WJ] = lim aes || 2| 
r> vio (G3 tlg; t> 2n 


t 


x expli | dt — H(p, q) + roo} (1.66) 


t 


with 


Comparing this expression for W[_J] with the Green’s function in eqn (1.64), 
we see that the generating functional W[J] corresponds to the transition 
amplitude from the ground state at ¢ to the ground state at ¢’ in the presence 
of an external source J(t), 


WL J] = <010>, (1.67) 


with the normalization W[0]=1. Thus the computation of Green’s 
functions is now reduced to the computation of W[J]. We will see later in the 
case of quantum field theory that the J(t)-independent factor <q; t|q’; t’> in 
eqn (1.66) is irrelevant for generating the connected Green’s functions and can 
be neglected. 


Euclidean Green’s function. In the formulae for Green’s function (eqns (1.63) 
and (1.64)) the unphysical boundary condition t’ + —1i00, t > 100 should be 
interpreted in terms of the ‘Euclidean’ Green’s functions which are defined by 


Sti, 25 tp) = "Git, ..., —it,). (1.68) 


The generating functional for the S-function is then given by 


W:[J] = lim im | [dq] exp} ae -7 (st) - Vig) + Jeu" | 


(1.69) 
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with 
o"W,LJ] 


St, aT) = 5J(t,)... d(T) 


J=0 


The unphysical limits t’ + —ioo, t—ioo make sense in ‘Euclidean 
space’ where ¢ is replaced by —it. Furthermore, the path integral in 
Euclidean space (eqn (1.69)), is well-defined for those potentials V(q) which 
are bounded below. This is because we can always readjust the zero point of 
V(q) such that 


dq V 
(S) + V(q) > 0 (1.70) 


and the exponential in (1.69) will always give a damping factor so that the 
path integral converges. Note that (1.70) is satisfied for a physically stable 
system. 

Thus, the path-integral formalism has well-defined meaning only in 
Euclidean (or imaginary-time) space. To obtain physical quantities in real 
space, we have to do an analytic continuation. In practice, we will just do the 
manipulations in real space with the understanding that they can be justified 
in Euclidean space. 

Let us summarize the discussion of the PI formulation of the quantum- 
mechanical description of a one-dimensional system. The basic results are the 
functional-integral formulae for the transition amplitude of eqns (1.41) and 
(1.47). In preparation for generalizing the formalism to field theory we have 
derived from these results the n-point Green’s functions in (1.64). All these 
Gt, ...t,)8 can be generated from W[J], the ground-state transition 
amplitude in the presence of an external source J. This central quantity can 
be computed according to (1.66) with an obvious generalization to systems 
with N degrees of freedom as 


N 
WUJ, ..., Jv] ~ lim | [ | [dq; dp] 


t'+-—io J i 
t— 100 
t 


x exp} | aY Pdi — H(p: gi) + & Ja} 


t 
or 


WUJ, ... Jy] ~ lim i [dq;] 


t= —i00 
t— 10 


l 


t’ 


N 
x expli | i ra qi) + 2 sa |} (1.71) 


Field theory 


We consider a field theory as a quantum-mechanical system with infinite 
degrees of freedom and make the following identifications for the results 
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presented above 


II [dq; dp;], > [do(x) dz(x)] 


L(qi, åi), H (qi, Pi) > [axs ($, ô„ġ), [arxae ($, 7) (1.72) 


with x(x), L(x), and #(x) being the conjugate momentum field, the 
Lagrangian density, and the Hamiltonian density, respectively. The ground 
state in field theory is generally referred to as the vacuum state. Thus the 
generating functional W[J] is the vacuum-to-vacuum transition amplitude 
in the presence of an external source J(x). The generalization of eqn (1.71) 
takes the form 


WUJ] ~ [ra dr] exp} | axan) Ooh(x) — Æ (n, $) + J wol (1.73) 


or 
WIJ] ~ | [do] exp} | d*x[ (p(x) + Joon. (1.74) 


Furthermore, the limit tf > ico in (1.71) suggests that we first calculate the 
Euclidean-space quantity W,[ J], which is the analytic continuation of W[ J] 
with x, = (t = it, x) replacing x, = (t, x). 


WL J] ~ | [de] exp) fasts (P(x)) + soca} (1.75) 


For field theory what we are interested in is the connected Green’s function 
which is related to the generating functional by 


l O"WeLJ ] l 


SS ee ee (1.76) 
WeLJ] OJ (X1)... J (Xn) 


G(X 54555 Xp) -| 


J=0 


Thus in order to remove the disconnected part of the Green’s function, an 
extra factor of W[J] has been inserted in the denominator of the definition 
(1.69). We recall that the same division was involved in our previous 
discussion of Green’s function (eqns (1.30) and (1.31)). The important 
practical consequence of this division is that the J-independent absolute 
normalization of W[ J] is immaterial for any subsequent calculation of the 
Green’s function. 
We now return to our illustrative example of 1¢* theory 


L (0) T Holp) + Llo) 
with 
LAP) = APAP) — 2u 7 


—À 
L) = gyi 
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The Euclidean generating functional 


1/od\? 1 l À 
WL J] = [ae exp) — fas; (3$) + 5 (Vp)? + 5 pep? + zi pt + 7| 
(1.77) 


may be written as 


Ô 
WEJ] = È [ie (3) WoLJ J (1.78) 
where | 
WLI] = [a exp| faxez, + s)| 
is the free-field generating functional. (For simplicity of notation we drop the 
subscript E and the bar over x indicating Euclidean space.) The factors 


—(0@/dt)? — (Vp)? in eqn (1.77) can be replaced by ¢(8?/ðt? + V7)d 
because the difference is a total four-divergence and we have 


l 
WoLJ] = {tae "E [ats d*yh(x)K(x, yp) + | aJe | (1.79) 


where 


2 
K(x, y) = sa- (- 2a- Vv? te) (1.80) 


As x and y may be taken as ‘continuous indices’, W,[ J] of eqn (1.79) can be 
considered an infinite-dimensional (N — oo) Gaussian integral of form 


fas, ... dOn exp| - D iK; + A 


l l 
Sooo — J. K- 1 ° J. zá l . 8 l 
/det K exp] 5 i( dij A ( ) 
The right-hand side is obtained by a generalization of the result cited in eqn 


(1.49). In this way the @ functional integral in (1.79) can be performed and we 
obtain, up to an inessential multiplicative factor, 


MLJ] = expl > [ats d*yJ(x) A(x, 40 | (1.82) 
where A(x, y) should be the inverse of K (x, y) in (1.80). Thus, 
[asko y) AQ, z) = 6*(x — 2). (1.83) 


It is not difficult to see that 


dtk elk ~) 
A(x, y) = | Se oso (1.84) 


where «x = (ikọ, k) forms a Euclidean momentum four-vector. The perturba- 
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tive expansion in powers of 4 of the exponential in (1.78) gives 


WEJ] = WoLJ]{1 + Aol] + Po [J] +...) (1.85) 


ô 4 
wı[J] = -2 Wo UJ] nile bs, l MLJ] 
e ô 4) 2 


l 5 F 
= -zay We alfe fe Dorn (1.86) 


When we plug in the explicit form (eqn (1.82)) for W,.[J], we obtain 


where 


wı[J] = 7 [A(x, y1) AQ, y2) AQ, y3) Alx, Va) IO 1) IO'2)J3) Ia) 


+ 3! A(x, y1) A(x, y2) A(x, x)J1)JO2)I, (1.87) 
and 


w2[ J] = 5 ots] 


+ san A(x, yı) A(x, ya) A(x, y3) A(x, X2) A(x2, Ya) 


x A(X2, Y5) A(X2, Ve IV) IO 2) IV3) JVs) IVs) IJV) 


3 
+ 4y A(x1, Y1) A(x1, y2) A7(x1, x2) A(x2, ¥3) A(x2, Y4) 


x IVD)IV2)IV3 IW) + 55 E Ae y1) AC, x1) A(x, X2) 
x A(xX2, Y2) A(X2, Y3) AX2, Va) IO1)IW2) JO3) Ja) 


l 

+ 8 A(x,, yi) A(x}, x1) A(x,, X3) A(x, X2) A(x2, ¥2)I1)IO2) 
l 

+ 8 A(x, yı) A? (x4, X2) A(x3, X2) A(x;, ¥2)IO1)I02) 


+ AGr, y1) AG, x2) Ala, 22002) (1.88) 


where we have dropped all J-independent terms (see Figs. 1.4 and 1.5). It is 
understood that all arguments (x;, y;) are integrated over. 

It is clear that the first factor on the right-hand side of (1.88), 402, 
corresponds to a disconnected contribution. For the connected Green’s 
function defined by (1.76) 


ô" In WL J] 


(n) E a 
G™(X1 Xn) 0J(x,)... ÔI(Xa)lJ=0 


(1.89) 
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J2 Y4 
FIG. 1.4. Graphic representation of w, in eqn (1.87). 


Y] J4 
x x 
J2 : Js 


J3 Y6 


Ji J3 J2 
x) 
J2 Y4 Y4 


X? x} X2 
| | | Jı J2 
X xX X y Jı Xi J2 


Fic. 1.5. Connected parts of œw, in eqn (1.88). 


such terms would not contribute. To see this explicitly, 
In WEJ] = In WEJ] + Inf + Wo LJ WL] — WUJ] 
= In WJ] + Infl + Wo Le?” — 1)WoLJ]}, (1.90) 


where we have used (1.78). Since W5 ‘(eS*” — 1)W, is also small, we can 
expand the exponential as well as the logarithm. Thus from eqn (1.85) 


In WEJ] = In WJ] + (4w; + A720, +...) -Hw + 120, +...)% +... 
= In MLJ] + Aw, + A*(@, — 57) +.... (1.91) 


Thus the disconnected $m? in œ, is in fact cancelled. It is not difficult to 
generalize this, to prove that all disconnected contributions disappear in 
In WL J]. 

We note that what corresponds to Wick’s theorem is simply the rule for 
functional differentiation 


dJ(y) Ô 
SJO) = 5a) { 100) 86 — y)dx 
= 0(x — y). (1.92) 


Differentiation according to (1.89) finally yields the (Euclidean) Green’s 
function. For example, the terms with four Js in (1.87) and (1.88) give rise to 
the first- and second-order four-point functions. These results are the same as 
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those in (1.33) and (1.35) (with the propagator given in eqn (1.14)) except 
that they are valid in Euclidean space. 

The analytic continuation x, > x, and x, > k, of A(x — y) in (1.84) yields 
the familiar Feynman propagator (1.14) 


ee [dik eo he” 

A(x — y) > iA,(x — y) = i (Sere 
The ie factor in the denominator indicates how the boundary condition on 
the propagator is to be imposed. It corresponds to the addition of a died? 
term in the Lagrangian and hence provides a.suitable damping factor for the 
path integral (1.77) in Minkowski space. 

Clearly the same set of Feynman rules, which we briefly reviewed in §1.1, 
follow from PI quantization formalism. The reader is referred to Appendix B 
where a practical guide to the derivation of the Feynman rules is given. 


1.3 Fermion field quantization 


Here we discuss the quantization procedure for systems with fermions. After 
briefly reviewing the canonical formalism, we indicate how the correspond- 
ing path-integral quantization can be formulated (see, for example, Berezin 
1966). This involves the subject of Grassmann algebra. 


Canonical quantization for fermions 


In §1.1 we reviewed the canonical quantization procedure for a scalar field. 
Bose-Einstein statistics follow naturally from the commutation relations of 
the particle creation and annihilation operators (1.11), i.e. from the 
commutation of scalar field operators (1.3). For a many-fermion system, in 
order to arrive at an exclusion principle the field operations must satisfy a set 
of anticommutation relations. Consider the case of free Dirac field, 


L(x) = W(x)(iy" 6, — myx). (1.93) 
Eqn (1.2) yields the Dirac equation 
(iy 0, — my (x)= 0. (1.94) 


In quantum theory, the field w(x) and its conjugate momentum z(x) = 


iy (x) are postulated to be operators; they satisfy the canonical anticommu- 
tation relations 


{ w(x, t), w(x’, t)} = 6°(x = x’) 
(W(x, t), Wx’, 2)} = (W(x, t), W(X’, 2} = 0 
where {A, B} = AB + BA. Following the same steps as in the scalar case of 


§1.1 we formally solve the Dirac equation and calculate the Feynman 
propagator function 
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ISe(X1 — X2)ag = COLT (W(X 1) g(X2))10> 
dtk i —ik+(x;— x2 
a er 099 


For interacting systems the perturbative solution in the form of Feynman 
rules can be developed, again much in the same manner as the scalar case of 
§1.1. We shall not repeat the steps here except noting that a consequence of 
the anticommutation relation is that there will be a minus sign for each closed 
fermion loop in a Feynman diagram. 


Path-integral quantization for fermions 


Quantization of the fermion system can also be carried out by expressing the 
transition amplitude directly as the sum over all possible world lines 
connecting the initial and final states. The generating functional is then 


Win, 1] = | [dy(x)][dy(x)] exp} farce (WW) + on + w} (1.96) 


where w(x), w(x), n(x), and 7(x) are (classical) fermion fields and sources, 
respectively. While the sum over the path for a boson system is a functional 
integral over ordinary c-number functions (classical scalar fields), the 
functional integral in (1.96) must be taken over anticommuting c-number 
functions (‘classical’ fermion fields) 


(W(x), WO} = W, WX)} = (WX), V = 0 
(n(x), m(x')} = (nx), AD = A), H(x')} = O. 


Thus they are elements of Grassmann algebra. In the following section we 
shall provide a brief introduction to this subject. 


Grassmann algebra 


In an n-dimensional Grassmann algebra, the n generators 6,,0,,..., 0, 
satisfy 


{6;,6;$ =0 iij=1,2,...,2 (1.97) 
and every element can be expanded in a finite series 


p(0) = Po + PIPO, + PH) 6,0, +... + PO 


iyi2 


Oi, ...8; 


etn, n 


where each of the summed-over indices /,, i,,..., i, ranges from 1 to n. The 
expansion terminates because of (1.97). We shall now discuss the subject of 
differentiation and integration in such an algebra. Before stating the general 
n-dimensional results, we first motivate them with the simplest case of one 
Grassmann variable, 


{0,0 =0 or 0 =0. (1.98) 
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Thus any element of the algebra has the simple expansion 
p(@) = Po + OP. (1.99) 


If we take p(@) to be an ordinary number, then P) and P, are ordinary and 
Grassmann numbers, respectively. (We can imagine embedding this one- 
dimensional Grassmann algebra into a higher-dimensional one so that we 
would have more than one anticommuting element.) 

The operation of differentiation may be taken from left or right with the 
basic definition 


ay: eee | eee N 1.10 
40 0=0 qa l (1.100) 
We have the ‘left derivative’ 
d 
— p(0) = P 1.101 
40 p(@) = Py (1.101) 
and the ‘right derivative’ 
d 
p(@) o~ — P, (1.102) 


because (dP,/d0) = 0 and (d/d0) anticommutes with P,. 

We next introduce the integration operation, which ordinarily is taken to 
be the inverse of differentiation. However such an inverse is ill defined in a 
Grassmann algebra, as can be seen by the fact that, for either type of 


derivative, 
2 


d 

qg rO = 0. (1.103) 
Thus we must be content with a formal definition of the integration 
operation which preserves some general properties of our intuitive notion of 


integration. We require it to be invariant under a translation of the 
integration variable by a constant. Thus 


| dôp(0) = [a0 + a). (1.104) 
From (1.99) we must have 
[aor =Q or fas =0 (1.105) 


where « is another element in the Grassmann algebra which is independent of 
0 and anticommutes with 0. We can normalize the remaining integral using 


{aoe =|. (1.106) 
From (1.105) and (1.106) it follows that 


[ado = P, (1.107) 
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which is the same as left differentiation in (1.101). Thus our definitions of 
integration and (left) differentiation lead to the same result 
d 
[aro = qg PO SP (1.108) 


We next consider the problem of change of integration variable 
0 + 6 = a + b0, where a and b are anticommuting and ordinary numbers, 
respectively. For an ordinary c-number we have the familiar relation 


| dzf(x) = | dx (=) FR). (1.109) 


What will be the corresponding result for Grassmann numbers? Since, by 
(1.108), 


d 
| 000) = qg PO) = P, (1.110) 
and 
[a0 = [aover, = bP,, (1.111) 
we have 
d6\~} 
| d6p() = | dé (a) pO). (1.112) 


Thus for anticommuting numbers the ‘Jacobian’ is the inverse of what we 
would ordinarily expect. 

We now proceed to generalize our one-variable results of (1.101), (1.102), 
(1.105), and (1.106) to the n-dimensional Grassmann algebra. We have the 
‘left derivative’ 


d 
qg (9182 «++ On) = 5in82 «Oy — 5:2818 -On + ++. (= 1" Sin + Oni 


and the ‘right derivative’ 


pam 


d 
(0,6, son's On) a5 = OinO 1 eee 0-1 — Fenn e + (—1)""! 01,95 eee 0,- 


Thus, to calculate the left (right) derivative (d/d0;) of 0102, ..., 8,, commute 
6; all the way to the left (right) in the product; then drop that 0;. The symbol 
dé,, d@,,..., d0, is introduced with the conditions 


and 
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which defines the integration operation. For a change of integration variable 


5. = b,.0. (1.114) 


J J? 


we have the generalization of (1.112) 


~ 


=] 
{a6 ... db, p(ð) = {a ... d0; fae A p(6(@)). (1.115) 
To show this result we follow the same steps as in the one-0 case. Just as in 
(1.110) and (1.111), we need to compare [d0,...d6,p(6) and 
f d0, ...d6,p(6(@)). The only terms in p(0) which can contribute to these 
integrals are terms with n 6s, 


Oisen On = biria Dy 8, «8... (1.116) 
The right-hand side is non-zero only if i,,..., i, are all different and we can 
write 
ði... 8, = bin. bnini.. i01 On 
= (det b)0; ... 0,. (1.117) 


However, in order to maintain the normalization conditions (1.113), we must 
have 


dð, ... dð, = (det b)+ d0; ... d0,; (1.118) 


hence the result of (1.115). To repeat, for anticommuting variables inte- 
gration is equivalent to differentiation and we get [det (dĝ/d0)] 7! rather 
than [det (dĝ/d0)]. 

As we have seen in §1.2 the Gaussian integral plays an important role in 
the PI formalism. Thus we need to evaluate 


G(A) = | do, ... d0, exp(4(6, A0)) (1.119) 


0.. First consider the 


where A is an antisymmetric matrix and (0, 40) = 0;4;;0;. 


simple case of n = 2 
0 A 
A = 12 
— Á> 0 


G(A) a [as d0, exp(6,6,A 12) 


and 


= [as dé,(1 + 0102412) 
= Ai, = Jdet A. (1.120) 


For the general case where A is an n x n antisymmetric matrix, we can first 
put A in the standard form by a unitary transformation. (Here n is taken to 
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be even as the integral vanishes for odd n.) 


UAU' = A, (1.121) 
with 


A. a 3 l (1.122) 
| ai 36 


This can be seen as follows. Since i4 is hermitian, it can be diagonalized by a 
unitary transformation V 


ViA)Vt = A, (1.123) 


where A, is real and diagonal with diagonal elements which are solutions to 
the secular equation 


detlid — Al| = 0. (1.124) 


Since A? = — A, we have det|iA — AI|’ = det|—iA — AI| = 0.Thus, if A is a 
solution, so is (—A), and A, is of the form 


Ace b i (1.125) 


To put A, into the standard form of (1.122), we use the 2 x 2 unitary matrix 


S as (1.126) 
“m oN i 
which has the property 


fi o 0 1 
so HEEE J) (1.127) 


Thus S(—iA,)S' = A, for 


S= y (1.128) 


and the unitary matrix in (1.122) must be the product U = SV because 
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(SV)A(SV)' = S(—iA,)S' = A,. Furthermore, let 


qi! 


Thus, 
det(T7 +) = ydet A. 
We can then write 


T(UAU®)T = TA,T = A, 


The Gaussian integral (1.119) can then be written as 


G(A) = | do, ... d0, exp&(0, U'T "A, T + UO). 


Change the integration variable 
ĝ = (T71 U)0 
and use (1.115) 


G(A) = | dĝ, ... dd, exp(4G, 4,8) ae $5) 


dð 
= a(S) : 


Since, by (1.133) and (1.130) 


ĝ 
a(S) = det(T~'U) = det(T~*) = ydet A, 
we obtain the result 


G(A) = | d0, ... d0, exp(4(6, 46)) = det A 


(1.129) 


(1.130) 


(1.131) 


(1.132) 


(1.133) 


(1.134) 


(1.135) 


(1.136) 


which should be contrasted with the Gaussian integral with ordinary 
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commuting real variables 


dx, dx, i l 
TF —3(x, Ax)) = ——— 1.137 
| Tag Jag P(A 49) = GG (1.137) 
or with ordinary commuting complex variables (z = x + iy) 
dz, dz, dz¥ dz* Š l 
a ee a -(z*, Az)) = — 1. 
| Tat oe ag te eX EAN) = FG (1.138) 


where fdz dz* = {dxdy. The Gaussian integral for complex Grassmann 
variables can be shown to have the value 


|an, dð, ... d0, dð, exp(ð, 40) = det A (1.139) 


where 0; and ĝ; are independent generators of the algebra. 

The classical fermion fields w(x) and W(x) are then taken to be elements of 
an infinite-dimensional Grassmann algebra. All the above results for the 
general n-dimensional case can be naively extended. 

Since the fermion fields always enter the Lagrangian quadratically 
Lf = (WA), the functional integral of (1.96) will be a generalized Gaussian 
integral. The result in (1.139) can then be applied 


W = [ravon [dy(x)] exp) [atay 


— det A (1.140) 


where we have not bothered to display the source fields. W is the vacuum-to- 
vacuum amplitude and the (connected) Feynman diagram representation, as 
generated by In W, will be a set of single-closed-fermion-loop graphs (Fig. 
1.6). The change of going from the ordinary functional integral (1.138) to the 
anticommuting variable functional integral (1.140), with the replacement of 
(det A)~' by (det A), corresponds to changing the overall sign of In W. This 
is the familiar Feynman rule of an extra minus sign for each closed fermion 


~ OOO 


Fic. 1.6. Vacuum-to-vacuum amplitude as represented by single closed loops. 


2 


Introduction to renormalization 
theory 


Given any quantum field theory one can construct the Feynman rules for 
calculating the Green’s functions and S-matrix elements in perturbation 
theory as described in Chapter 1. But in relativistic field theory one often 
encounters infinities in the calculation of diagrams containing loops. This is 
because the momentum variable in the loop integration ranges all the way 
from zero to infinity. In other words, for a relativistic theory, there is no 
intrinsic cut-off in momenta. These divergences will render the calculation 
meaningless. The theory of renormalization 1s a prescription which allows us 
to consistently isolate and remove all these infinities from the physically 
measurable quantities. It has been of utmost importance to the development 
of relativistic quantum field theory. 

It should be emphasized however that the need for renormalization 1s 
rather general and is not unique to the relativistic field theories. 
Renormalization has its own intrinsic physical basis and is not brought about 
solely by the necessity to expurgate infinities. Even in a totally finite theory 
we would still have to renormalize physical quantities. The following 
example should illustrate this point. Consider an electron moving inside a 
solid. Due to the interaction of the electron with the lattice, the effective mass 
of the electron m*, which determines its response to an externally applied 
force, is certainly different from the mass of the electron m measured outside 
the solid. The electron mass is changed (renormalized) from m to m* by the 
interaction of the electron with the lattice in the solid. In this simple case one 
can in principle measure both m* and m by switching on and off the 
interaction (i.e. by placing the electron inside or outside of the solid). Clearly 
the difference is finite since both m and m* are finite and measurable. For the 
relativistic field theory, the situation is the same except for two important 
distinctions. First, renormalization due to the interaction 1s generally infinite 
(corresponding to the divergent loop diagrams). Second, there is no way to 
switch off the interaction; hence quantities in the absence of interaction, 
called the unrenormalized or the bare quantities, are not measurable. For 
example, in quantum electrodynamics the difference between the bare 
electron mass m and the renormalized mass m* is infinite, and the bare mass 
cannot be measured because the electron interacts with the virtual photon 
field constantly and there is no way to turn off this interaction. 

The programme of removing infinities from physically measurable quan- 
tities in a relativistic theory, the renormalization programme, involves 
shuffling all divergences into the bare quantities. In other words, the 
unrenormalized quantities are assumed to be appropriately divergent to 
begin with and the infinite renormalization due to interaction then cancels 
these divergences to produce finite renormalized quantities. We should recall 
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that in a relativistic quantum field theory the renormalized quantities are 
physically measurable while the bare ones are not. This difficult programme, 
as originally formulated for quantum electrodynamics by Feynman (1948)), 
Schwinger (1948, 1949), Tomonaga (1948), and Dyson (1949), has been 
quite successful and in the case of QED the agreement between theory and 
experiment has been spectacular. 

Technically the theory of renormalization is rather complicated. A detailed 
and thorough discussion of this subject is beyond the scope of this book. In 
this chapter we shall explain the principal ideas behind it and give examples to 
illustrate how it works. 


2.1 Conventional renormalization in 4¢* theory 


We shall first use the simple 4¢* theory as an example to illustrate the 
renormalization procedure. The Lagrangian density is separated into free 
and interacting parts 


L= Lit FL; (2.1) 
with 
Ly = ALOON — 15606] (2.2) 
and 
AG 46a 
fF = — Gy 0: (2.3) 


The propagator and the vertex of this theory are displayed in Fig. 2.1. 


i 
SO —iA 
p — pi tie x £ 


Fic. 2.1. 


We will concentrate on the one-particle-irreducible (1P1) diagrams. They 
are the Feynman diagrams which cannot be disconnected by cutting any one 
internal line. Correspondingly, we define the one-particle-irreducible (1PI) 
Green’s functions, denoted by I'(p,...p,), which have contributions 
coming from 1PI diagrams only. For example the graph in Fig. 2.2(a) is a 1PI 
diagram while the one in Fig. 2.2(b) is not. The reason for selecting 1PI 
diagrams is that any one-particle-reducible diagram can be decomposed into 
1PI diagrams without further loop integration, and if we know how to take 
care of the divergences of 1PI diagrams we will also be able to handle the 


Fic, 2.2. 
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reducible diagrams. For example, the two-point Green’s function 
(propagator) 


iA(p) = [ats e7? *C0|T (Po(x)ho(0))]0> (2.4) 


can be decomposed in terms of the 1PI self-energy parts X(p) as in Fig. 2.3. 


Fic. 2.3. The propagator as a sum of 1PI self-energy insertions. 


Then we can write the propagator as 


i i 
TNC) ee ne SEE ET) (| eee 
(p) p — pe + ie a T 1d(p oe sey aire 


1 | 1 
=o. os. | G 
p“ — Ho + ie NE 1 | 
1+12 —— 
D) a tie 
i 


s ost 2.5 
p> — u — (p°) + ie (2-9) 


Clearly if we can make the proper self-energy part X(p7) finite, the propagator 
A(p) will also be finite. 

Since there is no divergence in the tree (zero-loop) diagrams, we begin our 
calculation with the one-loop 1PI graphs. It is not difficult to see that Figs. 
2.4 and 2.5 represent an exhaustive listing of all the one-loop divergent 1PI 
diagrams in this 1g* theory. Fig. 2.4 is the self-energy graph 


iño | d4l i 


a a E VE E E 
ARS) 2 J] (2t P — u + ie 


(2.6) 


The factor of 1/2 is the symmetry factor, of which some examples were given 
in §1.1. Or we can deduce it directly from the fact that there are 4:3 = 12 


FIG. 2.4. 


P 4 
3 p j; i 
P; P4 
Pi P3 
P; p; Ri P2 
(a) (b) (c) 


FIG. 2.5. 
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ways to ‘contract’ o(x,)@(x,) into the interaction term #*(x) and this does 
not completely compensate for the 1/4! factor in (2.3). The integral in (2.6) is 
quadratically divergent. Fig. 2.5 shows the vertex corrections with contri- 
butions given by 


(—id,)? f dtl i i 
T= Tp") =) = 5 lor (=e eee, 
-rA T,=Tw (2.8) 


where 
s=p?=(p,t+p2), t=(pi-p3)?, u=(pı- p4? (2.9) 


are the Mandelstam variables. The contributions in (2.7) and (2.8) diverge 
logarithmically. 

In the renormalization programme one first introduces some appropriate 
regularization schemes so that all divergent integrals are made finite. We 
are then free to manipulate (formally) these quantities, which are divergent 
only when the regularization is removed (e.g. by letting the cutoff approach 
infinity) at the end of the calculation. The commonly used regularization 
schemes will be discussed in §2.3. In the meantime it should be understood 
that by divergences we mean the regulated divergent quantities which are 
finite and cutoff-dependent. 

For any divergent diagram we will first separate the divergent part from 
the finite part, then absorb the divergences in some appropriate redefinitions 
of mass, coupling, and field operators. To make the separation one uses an 
important property of the Feynman integrals given in (2.6) and (2.7): if one 
differentiates the divergent integral with respect to the external momenta, 
this increases the power of the internal momenta in the denominator and 
makes the integral less divergent. (These are examples of the ‘primitively 
divergent’ diagrams—for further discussion, see §2.2.) Therefore, when 
differentiated a sufficient number of times, the result is completely con- 
vergent. For example, if one differentiates F'(p?) with respect to p?, one finds 


a | 

ape = a r(p°) 
A2 f d*i (l— p)-p ] 
m TE T E 2.10 
lop a a aa aa A 


which is finite. This means that the divergences will reside only in the first few 
terms of a Taylor series expansion in external momenta of the Feynman 
diagrams. For example, the Taylor expansion I'(p”) around p° = 0 is of the 
form 


l 
T(p°) = ao + a,p? + o Oy Dp?) +... 
where 


a Di i . (2.11) 


n an 2 
— A" e 
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The a,S are finite for n > 1 and only ay contains the logarithmic divergence. 
We can sum up all the finite terms and write 


T(s) = F(0) + Fs) (2.12) 
where I'(0) = ay is divergent and [(s) is finite with the property 
Î(0)=0. (2.13) 


These functions T (0) and ÎÑ(s) will be calculated explicitly in §2.3. Note that 
the finite part I'(s) is just the original I'(s) with its value at s = 0 subtracted 
out. Hence this procedure is sometimes referred to as the subtraction. 

In the following discussion we shall use the Taylor expansion of (2.6) and 
(2.7) to separate the divergent part from the finite part and absorb the 
divergent parts into redefinitions of the bare quantities. 


Mass and wavefunction renormalization 


The self-energy contribution given in eqn (2.6) is quadratically divergent. But 
this one-loop contribution has the peculiar property of being independent of 
the external momentum p. Hence the Taylor expansion is trivial; i.e. 
X(p*) = (0). This is true only for the one-loop approximation in 1¢* 
theory. For example the two-loop self-energy diagram in Fig. 2.2(a) is 
quadratically divergent and has a non-trivial dependence on p°. Thus in 
general the Taylor expansion in external momenta around some arbitrary 
value u? will have two divergent terms 


X(p?) = Elp’) + (p° — PE?) + Èp?) (2.14) 


where (u?) is quadratically and Z'(yu?) logarithmically divergent, as each 
differentiation with respect to the external momentum 0/dp, decreases the 
degree of divergence by one unit and X’(u*) can be written in the form 
3(0/dp,)(0/0p")=(p*)|,2-,2. Note that in general a quadratically divergent 
diagram will have three divergent terms with quadratic, linear, and 
logarithmic divergences. But in X(p”) there is no linearly divergent term 
because a term proportional to p, is not Lorentz invariant. The last term in 
(2.14) is finite and has the properties 


(u?) = 0, (2.15) 
x'(u7) = 0. (2.16) 


Of course in the one-loop approximation £'(p?) = (p?) = 0 for all values 
of p*. But in general the self-energies do not vanish. Substituting (2.14) into 
the expression for the full propagator in (2.5), we have 


i 
p? — we — (p?) — (p? — u’) — X(p?) + ie 
The physical mass is defined as the position of the pole of the propagator. 
Since up to this point u? is arbitrary, we can choose it to satisfy the equation 


pò + X(u*) = we. (2.18) 


iA(p) = (2.17) 
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Then 
i 


Using (2.15) one sees that A( p?) has a pole at p? = u’. Thus px? is the physical 
mass and is related to the bare mass through eqn (2.18). This is the mass 
renormalization. Since X(u7) is divergent, the bare mass u must also be 
divergent so that the physical mass p? is finite. To remove the divergent term 
E'(u?) we note that both L’(u?) and X(p”) are of order A, (again keep in 
mind that all divergent quantities are regulated to be finite); we have 


X(p?) = [1 — Z'(u?)]X(p’) (2.20) 


and the propagator function can be written as 


iA(p?) = (2.19) 


iZ 
IiA(p?) = e e e L e 2.21) 
ue pP? — K —X(p?) + ie aa 
where 
Z (1 E(u] t = 14+ Dw?) + 00/2). (2.22) 


In this form the divergence is a multiplicative factor and can be removed by 
rescaling the field operator ¢@,). More specifically, if we define the re- 
normalized field @ by 


b = Z P bo, (2.23) 


then the renormalized propagator function given by 


iâr(p) = [ats e~? *<0IT (P(x) (0))|0) 


= Z% [ats e~ P: *£0[T(Po(x)o(0))|0> 


i 
= ——_._ HIS A 2.24 
p-p ipti T S MP) _ 


is completely finite. Z, is usually referred to as the wavefunction renormaliza- 
tion constant. In this way, the divergences in self-energy are removed by 
mass renormalization (2.18) and wavefunction renormalization (2.23). 

The renormalized field @ given in eqn (2.23) defines the renormalized 
Green’s functions Gt) which are related to the unrenormalized ones by 


GR(X1 Xn) = CO|T(P(%1) .-- O(%,))10D 
= Z5"*<0|T(bo(x1) --- Po(%n))10> 
= Za Gy ise %,) (2.25) 


Or, in momentum space, 


GRP ++» Pn) = Ze" GO(P1 - «+ Pn) (2.26) 
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where 
(27) 6*(D1 See ee a a Pn )GL(p, oe . Pa) = (i dx; we) 
i=1 


x GM(x,...%,) 227 


(2n)* Stp +... + PxdGI(Ps -+ Pp) = | (iI axe" 


. 


i=1 
x G(x]... Xp) (2.28) 


To go from the connected Green’s function given in (2.26) to the IPI 
(amputated) Green’s function, we have to eliminate the one-particle 
reducible diagrams, and also to remove the propagators for the external 
lines in 1PI Green’s functions, i.e. remove the Ap(p;)s from G” (p; ...Dn) and 
the A(p;)s from G” (p; ...P,,). Since Ap(p) and A(p) are related by 


Ax(p;) = Zg * A(p:), (2.29) 
the renormalized and unrenormalized 1PI Green’s functions are related by 
TR(P1 «+» Pa) = ZET 91 --- Pn). (2.30) 


Coupling constant renormalization 


We now proceed to renormalize the 1PI four-point function of Fig. 2.5. 
From eqns (2.7) and (2.8), this unrenormalized Green’s function is given, to 
order 12, by 


(5s, t, u) = —ifo + (5) + FO) + TW) (2.31) 


where on the right-hand side the first term is the tree-graph contribution and 
the last three terms are the one-loop contributions which are divergent. We 
want to absorb these divergences by a redefinition of the coupling constant. 
How is the coupling constant measured in Ad* theory? Since the basic 
vertex involves four particles, it would be natural to define the coupling 
constant in terms of the two-particle scattering amplitude, which is physically 
measurable. But for the discussion of the renormalization, it is more 
convenient to define the coupling constant in terms of the closely related 
renormalized 1PI (amputated) four-point function I'¥’'(p,,..., p4). Since 
Te is a function of the kinematical variables s, t, and u (i.e. it is not a 
constant), some particular point in the kinematical region has to be chosen to 
define the physical coupling constant. Remembering that for particles on the 
shell p? = u? these variables satisfy the relation s + t + u = 4”, one may 
choose, as a convention, the symmetric point, 
2 


So = lo = Uo = 3 (2.32) 


to define the coupling constant. Thus, 
PR(S0, to, Uo) = —ià (2.33) 


where À is the physical coupling constant. 
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We will now separate the divergent and finite parts in the unrenormalized 
vertex function of (2.31) by making a Taylor series expansion around the 
symmetric point given in (2.32) 


Ts, t, u) = —idp + 30 (59) + P(s) + Ft) + Fu) (2.34) 
where Î(s) = I'(s) — T (so) is finite and has the property 
Î (so) = 0. (2.35) 
One defines the vertex renormalization constant Z, by 
—i1Z; 2o = —idy + 31 (50). (2.36) 
Eqn (2.34) becomes 
Ps, t, u) = —iZ3 A, + F(s) + Ft) + Fu) (2.37) 
which at the symmetric point gives 
T(59, to, Up) = —iZ3 ‘Ao. (2.38) 


From the relation between the unrenormalized and the renormalized 1PI 
Green’s functions eqn (2.30), we have 


Ps, t, u) = ZG G(s, t, u). (2.39) 


Then using eqns (2.33), (2.38), and (2.39), we see that the renormalized 
(physical) coupling constant A defined in (2.33) is related to the un- 
renormalized coupling constant A, by 


A= Z2Zz "dp. (2.40) 


It is now easy to demonstrate the finiteness of the renormalized 1PI four- 
point function. From eqns (2.37), (2.39), and (2.40), one has 


PRP, -e Pa) = Zr (pi, -++> Pa) 

—iZ,'Z3Ay + Z3[T(s) + To) + FW] 

—id + Z2(T(s) + F(t) + Pw)]. (2.41) 

Since Z, = 1 + O(Ao), F = O(A2), and A = åo + O(A3), we write to order A? 
TO(p,,..., P4) = —iA + F(s) + F(t) + Pu) + O(A3) (2.42) 


which is completely finite. 

For the renormalization of the connected four-point Green’s function to 
one loop, we have to add the one-particle reducible one-loop diagram (Fig. 
2.6) and attach propagators for the external lines. Thus the unrenormalized 


pd + 3 other diagrams 


Fic. 2.6. 
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Green’s function Gp, ... pa) is given by 


4 
GS(p,...Pa) = I] (saz) lo + 3I (so) + F(s) + F(t) + Tu) 


j=] p? — p + ie 
E ai J} 2.43 
ae = PON pe = 18 Hie) oe 


The first and last terms in (2.43) can be combined to give 


‘ : 1 i l 
cli airi ilar] 


l = 1 


Since r ~ O(A2), Č ~ O(A2), we can also write 


4 1 i i i 
[I (saree) [Oreo +Tis)+fa+hw] 


1 1 
7 p z = Ho — ANA z) 


x [3r(so) + (9) + Ft) + Puy] + OV). (2.45) 
Using eqns (2.44) and (2.45), we can write eqn (2.43) as 
üi 1 
eens i È — pu — T 
x [-i + 3T (so) + P(s) + TO) + Pw] 


= | Il ACP) | rP(pi -.. Pa) (2.46) 


j=1 


where we have used eqns (2.5) and (2.31). The renormalized four-point 
Green’s function is defined by (2.26) as 


GE(p bi p4) = Z °GY(P1 sse Pa): (2.47) 


Then from eqn (2.46) and the relations between the renormalized and the 
unrenormalized quantities (2.29) and (2.39), we get 


4 
GE(p,..- Pa) = Zp? |z; lI inne) | Ze TR(p1... Pa) 
j=1 


4 
= II [iAr(p)ITR(P1 -+ Pa) (2.48) 


which is also finite because Ag(p) and I'(p, ... p4) have been shown to be 
finite. 

We see that the mass, wavefunction, and vertex renormalizations remove 
all the divergences in the two- and four-point Green’s functions in the one- 
loop approximation. There is no divergence in the other 1PI diagrams 
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although the one-particle reducible graphs for the higher-point functions 
have divergent one-loop graphs. For example, the six-point function in Fig. 
2.7 is divergent. However, it is clear that the divergence is brought about by 
that of the four-point vertex function and it is removed once we renormalize 
the four-point vertex function. 


Ot 


FIG. 2.7. 


In summary, Green’s function can be made finite if we express the bare 
quantities in terms of the renormalized ones through relations (2.18), (2.23), 


and (2.40) 
$ = Z5 "ġo (2.49) 
kez Zi (2.50) 
K? = uo + dp? (2.51) 


where ôu? = E(u? ). More specifically, for an n-point Green’s function when 
we express the bare mass py and bare coupling constant A, in terms of the 
renormalized mass u and coupling A, and multiply by Z% */* for each external 
field as in (2.26), then the result (the renormalized n-point Green’s function) 
is completely finite 


GL(P1, sey Pho A, LL) a Zo" GY (Pi, re) Pn; ho; Ho, A) (2.52) 


where A is the cut-off needed to define the divergent integrals. This feature, in 
which all the divergences, after rewriting 4) and py in terms of A and yp, are 
aggregated into some multiplicative constants [Z,”/* in eqn (2.52)], is called 
being multiplicatively renormalizable. Equivalently, the 1PI Green’s functions 
are made finite as in (2.30) by multiplying by Z¥? and expressing the bare 
quantities 1), Uo in terms of the physical quantities A, u, 


PR(Pi> -o Pri As U) = ZITO, -s Pas 20s Ho» A). (2-53) 


The programme of removing divergences as outlined in this section is closely 
related to the one originally developed and we shall refer to this as the 
conventional renormalization scheme. 


2.2 BPH renormalization in 4ġ4 theory 


BPH renormalization (Bogoliubov and Parasiuk 1957; Hepp 1966; 
Zimmermann 1970) is completely equivalent to conventional renormaliza- 
tion. This alternative formulation of the programme is often more 
convenient for many applications of the renormalization theory. In this 
section we shall simply illustrate the connection between these two re- 
normalization schemes. For a concise and lucid presentation see Coleman 
(1971b). 
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For the original (unrenormalized) Lagrangian (2.1) 
À 
Lo = H(6,b0)” — ups] — Fb (2.54) 


we can replace the bare quantities by renormalized quantities using eqns 
(2.49), (2.50), and (2.51) to obtain 


Lo=zaL+hML 
where 
Z = 46,0" - 2671-56 (2.55) 
and 
AL = a a (2,0)? — ug] + H zp? = A $t. (2.56) 


£, which has exactly the same form as Zo but with all the unrenormalized 
quantities replaced by renormalized ones, is called the renormalized 
Lagrangian density. AY contains the divergent renormalization constants. 
(Z, — 1), (Z, — 1), and dp? are all of order A and this makes AY of order 
AL. We call AY the counterterm Lagrangian. 

‘The BPH renormalization prescription consists of the following sequence 
of steps 


(1) One starts with the renormalized Lagrangian of eqn (2.55) to construct 
propagators and vertices. 

(2) The divergent part of the one-loop 1PI diagrams is isolated by the 
Taylor expansion. One then constructs a set of counterterms AZ which is 
designed to cancel these one-loop divergences. 

(3) A new Lagrangian Z = Z + AF" is used to generate two-loop 
diagrams and to construct the counterterm AY?) which cancels the 
divergences up to this order and so on, as this sequence of operations is 
iteratively applied. 


The resulting Lagrangian is of the form 
PO aL tne 
where the counterterm Lagrangian AY is given by 
AZ SAZU FAZA Fiss FALT Pre (2.57) 


In order to show that this renormalization scheme is equivalent to the 
conventional one which develops the unrenormalized perturbation theory 
directly we need to show that the counterterm Lagrangian (2.57) has the 
same structure as that of eqn (2.56). To demonstrate this we shall use the 
power-counting method to study the counterterms. 


Power-counting method 


To analyse the divergent structure of any Feynman diagram we introduce the 
term superficial degree of divergence D, which is the number of loop momenta 
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in the numerator minus the number of loop momenta in the denominator. 
For example the graph shown in Fig. 2.8 has D = 4 — 4 = 0. Hence it is 
expected to be logarithmically divergent. To calculate D for any graph in the 
Ag* theory we define the following numbers. 


B = number of external lines; 
IB = number of internal lines; 
n = number of vertices. 


Since each vertex has four lines and both ends of an internal line must 


Py l P3 
p2 l, P4 
FIG. 2.8. 


terminate on vertices while only one end of an external line is connected to a 
vertex, we have the relation 


4n = 2(IB) + B. (2.58) 


We need to convert some of these to the number of loop momenta. The usual 
Feynman rule requires us to integrate over internal momenta which are not 
fixed by momentum conservation at each vertex. Thus we expect the number 
of loop momenta (L) to be the number of internal lines (ZB) minus the 
number of vertices (n). But one of the combinations of momentum 
conservation ĝ-functions just expresses the overall momentum conservation 
and it does not depend on the internal momenta. For example the graph in 
Fig. 2.8 has two vertices and hence two 6-functions: 6*(p, + pa — L — L) 
6*(1, +1, — p3 — p4). But this can be written as 6*(p, + p2 — P3 — Pa) 
6*(p, + pa — L — L). The two vertices eliminate only one, rather than two, 
internal momenta. Therefore, we have 


LIB =n: (2.59) 


For each internal line the propagator contributes two powers of loop 
momenta in the denominator and each loop integration contributes four 
powers of loop momenta in the numerator. For 1¢* theory the vertices do 
not contribute any momentum factors and the superficial degree of 
divergence is given by 


D = 4L — 2(IB). (2.60) 

We can eliminate L and IB in favour of B and n by using eqns (2.59) and 
(2.58), 

D =4-— B. (2.61) 


Since Ad* theory has reflection symmetry ¢ > —@, B must be an even 
number and eqn (2.61) implies that only the two-point function (B = 2) and 
four-point function (B = 4) are superficially divergent. 
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From this power counting, which is valid to all orders of perturbation 
theory, we can now study the structure of the counterterms. For the two- 
point function we have, according to (2.61), D = 2. Being quadratically 
divergent, the necessary Taylor expansion is taken to be 


X(p?) = £(0) + p*z'(0) + X(p?) 


where X(0) and £’(0) are divergent while =(p7) is finite. There is no term 
linear in p, as X(p’) is a Lorentz scalar. We need to add two counterterms 
5X(0)7 + 42'(0)(0,@)* to cancel the divergences. They correspond to the 
Feynman-rule vertices shown in Fig. 2.9(a), (b). The four-point function has 
D = 0 and the Taylor expansion 


rop) = T(0) + F(p,) 


where I'(0) is a logarithmically divergent term which is to be cancelled by a 
counterterm of the form (iI"(0)/4!)6*. This has the graphic representation 
shown in Fig. 2.9(c). 


i È (0) i Z'(0) p? 
> @>— A —Tr‘®(0) 


(a) (b) (c) 
FIG. 2.9. Feynman-rule vertices corresponding to the counterterm Lagrangian (2.11). 


The general counterterm Lagrangian is then of the form 


EO) E0) ir®(0) 
n a a 


> pt (2.62) 


(ô p) + 


which is clearly the same as eqn (2.56) with the correspondences 
2'(0)=Z¿—1 
X(0) = — (Z — 1u? + ôu? = —X'(0)u? + ôu? (2.63) 
r®(0)= —id(1 — Z,). 


They are consistent with eqns (2.22), (2.51), and (2.36) as the renormalized 
coupling / here is defined at the zero momentum point, thus T‘®(0) = 3T (0). 
This demonstrates the equivalence of BPH renormalization and conven- 
tional renormalization. 


Comments on subgraph divergences 


We shall not present any proof that, to all orders in the perturbation 
theory, this renormalization programme removes all divergences in the 
Green’s functions. We merely illustrate some general features of the 
renormalization procedure for higher-order diagrams and the convergence 
properties of Feynman integrals with the following remarks. 


(1) We state without proof the following convergence theorem (Weinberg 
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1960). The general Feynman integral converges if the superficial degree of 
divergence of the graph together with the superficial degree of divergence of all 
subgraphs are negative. To be more explicit, consider a Feynman graph with n 
external lines and / loops. Put a cut-off A in the momentum integration to 
estimate the order of divergence 


A 


Tapie Py-1) = [tay ballon pazia 4) (2.64) 


0 


where Z is the product of vertices and propagators depending on p; (external 
momenta) and q; (internal momenta). Take a subset S = {q)...q,,} of the 
loop momenta {q, ... qı} and scale them to infinity (all q; > A with A > œ), 
all other momenta being fixed. Let D(S) be the superficial degree of 
divergence associated with the integration over this set, namely 

A 


| eta 2a gd 


0 


< A)fIn A} (2.65) 


where {In A} is some function of In A. Then the above theorem states that the 
integral over {q, ...q,} converges if the D(S)s for all possible choices of S are 
negative. For example the graph in Fig. 2.10 being a six-point function has 
= —2. But the integration inside the box having D = 0 is logarithmically 
divergent. Thus a successful renormalization programme must systematically 
remove all divergences including those associated with the subintegrations. 
In the BPH procedure these subdiagram divergences are in fact renormalized 
by low-order counterterms. For example, the graph in Fig. 2.11 with its 
counterterm vertex will cancel the subgraph divergence of Fig. 2.10. 


Fic. 2.10. FIG. 2.11. 


(2) There is another aspect of the renormalization programme related to 
these graphs with divergent subintegrations: not all divergences in a multi- 
loop diagram can be removed by subtracting out the first few terms in the 
Taylor expansion around the external momenta. This can be illustrated by 
the following example. Consider the two-loop graph of Fig. 2.12(a) which 
has the Feynman integral 


Tp) oc ALT (py? (2.66) 
where 


l l l 
I(p) = = | dtl = — 2.67 
(p) al P — p +ie (—p)? — p +i en!) 


with p =p, +p,. With each of the I(p) factors being logarithmically 
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divergent, I'\*) cannot be made convergent no matter how many derivatives 
operate on it, even though the overall superficial degree of divergence is zero. 
However we have the lower-order counterterm — 4°T (0) corresponding to 
the subtraction introduced at the one-loop level. This generates the 
additional A? contributions of Fig. 2.12(b), (c) with TP? oc —A°T(p)r (0) and 
r® œ —A7T(0)T(p), respectively. 


P) l; l) P3 


P2 \-p hp P4 


(a) 
(b) (c) 


Fic. 2.12. s-channel A? four-point functions. The black spots represent the counterterm 
—A°T(0). 


Adding the three graphs, Fig. 2.12(a), (b), (c), we have 
Pp) = TOY +TP + TO 
= —A*[T(0)]* + AT (p) — r0]? (2.68) 
= TO) + Fp). 


Only the first term on the right-hand side is divergent and can be removed by 
a A? counterterm of the form iI (0)o*/4!. We see how, with the inclusion of the 
lower-order counterterms, divergences take on the form of polynomials in the 
external momenta. Thus for diagrams with more than one loop it is useful to 
characterize a divergent contribution as being primitively divergent or not. A 
primitively divergent graph has a non-negative overall superficial degree of 
divergence but is convergent for all subintegrations. Thus, they are diagrams in 
which the only divergence is caused by all of the loop momenta growing large 
together. In general only primitively divergent graphs suchas Fig. 2.13 can have 
their divergences isolated by direct Taylor-series expansion. For other cases, 
diagrams with lower-order counterterm insertions must be included in order to 
aggregate the divergences into the form of polynomials in the external 
momenta. 


Fic. 2.13. A primitively divergent four-point function. 


(3) In the above example of a two-loop, four-point function we have seen 
how the overall divergence can be isolated when diagrams with lower-order 
counterterms are included. For such cases where the divergent subinteg- 
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rations are disjoint this can be accomplished in a fairly direct manner. 
Similarly, it is also relatively easy for cases with nested divergences, i.e. for 
cases where one of each pair of divergent 1PI subgraphs is entirely contained 
within the other (see the example in Fig. 2.14). After the subgraph divergence 


Fic. 2.14. Nested divergences and a diagram with a lower-order counterterm which cancels the 
subintegration divergence. 


is removed by diagrams with lower-order counterterms (Fig. 2.14(b)), the 
overall divergence is then renormalized by a 4° counterterm. Thus for both 
disjoint and nested divergences the renormalization procedure is rather 
straightforward. The difficult step in the proof of the convergence (to all 
orders) involves disentangling the overlapping divergences, which are neither 
disjoint nor nested divergent 1PI diagrams. Fig. 2.2(a) is an example of 
overlapping divergence. Here it 1s difficult to see in a simple way how the 
subintegration divergences can be removed in a systematic fashion because 
they do not factorize in a simple manner. Nevertheless, this problem has been 
overcome and we refer the interested reader to the literature (Hepp 1966; 
Zimmermann 1970; Itzykson and Zuber 1980). The purpose of these 
comments is to indicate how the proof of renormalizability generally involves 
complicated graph classifications and combinatorial analysis. 


2.3 Regularization schemes 


In this section we will give detailed calculations of the various renormaliza- 
tion constants in the renormalized perturbation theory described in the 
previous sections. To make any meaningful mathematical manipulations on 
the divergent integrals we must cut off, or regularize, the momentum 
integration to make the integral finite. The divergent part will then be a 
function of the cut-off A while the finite part will be cut-off-independent in 
the limit A > œ. The cut-off procedure must be chosen in such a way that it 
maintains the Lorentz invariance and symmetry of the problem. There are 
two commonly used regularization schemes: the covariant cut-off and 
dimensional regularization. We shall illustrate them in turn. 


Covariant regularization 

In this procedure (Pauli and Villars 1949) the propagator will be modified as 
l l a; 

P- tie P—-wrie D aay Vera 


i 


(2.69) 


where A? > p? and the a;s are chosen in such a way that in the asymptotic 
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region the modified propagator will have a sufficient number of internal 
momenta in the denominator so that the integral is convergent. 

Let us start with the four-point function. The graph in Fig. 2.5(a) yields a 
contribution (2.7) 


(—iA)? f d*i i i 
= 2 = AE ERENER _———— E E S E le 2.70 
EPAR | Ope apenas Pata AS 
Clearly the replacement 


l l l E w — A? 

Paw te P-—pet+ie FaN die (PF — p? + ie)? — AZ + is) 
will be sufficient to render the integral finite. Eqn (2.70) then becomes 
—A*A? f dt] l 

2 (2n)* (1 — p)? — u? + iel? — u? + is)? — A? + ie) 
(2.71) 
We choose to make the Taylor expansion around p? = 0 (or to make 
subtraction at p? = 0), 


T(p) = 


F(p?) = T0) + (p°) (2.72) 
with 
-4A f dtl l 
O=- Ja erie N C 
—A7A? f dt l 
y dai an e a 
M(p*) = 2 lous (P — p? + ie)? — A? + ie) 


l l 
a ae - rr) 
: a? dtl 2l- p — p? 
2 J (2m) (FP — p? + ie) ((l— p) — u’ + ie) 
where in the last line we have taken the limit A — œ inside the integral 


because I°(p) is convergent. The standard method to evaluate these integrals 
is to first use the identity to combine the denominator factors 


(2.74) 


1 


l -= (1- pt | eae 1 ay z) (2.75) 


n 
Q,Az... a, (aiz +22, +... a,Z,) =] 


where the z;s are called the Feynman parameters. We can also differentiate 
with respect to a, to get 


1 
zı dz, dz,... dz, 


n 
> =n! | ————"__ §1-Sz,] (2.76 
AA... Ay í lz + a3Z3 +... AnZ_)"** ( $a) ( ) 


0 


This formula has the advantage that one less Feynman parameter is needed 
for the case where there are two identical factors in the denominator. Using 
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(2.76), we can combine the denominators in (2.74) to give 


1 


l ee eee 9 [ae (2.77) 


(P — p? + ie)? (l — p} — u? + ie A? 
where 
A = (1 — aX(ËP — u’) + a[l — p) — nw’) + ie 
= (l — ap)? — a? + ie 
with 
a? = p? — a(l — a)p’. 
Thus, 


1 
= 4] bee Se ae 
Pity = 2? | (12) da zat ia i 
(0) 


(2n)* [U — ap)? — a? + ie]? 


1 


dil Qa- Dp? 


(0) 


where we have changed the variable / to / + ap and have dropped the term 
linear in / which will vanish upon symmetric integration. It is more 
convenient to do the integration by the Wick rotation, which transforms the 
Minkowski momentum to the Euclidean momentum. First we note that 
d*/ = dl dl, dl, dl, and 


P—@+ie=2 —P —a’ +ie 
— B —[(? + a2)? — ief. 


This shows that the integral (2.78) has poles in the complex /)-plane as shown 
in Fig. 2.15. 


a 

l=—(P +a’)? +ie 
e 

Re | 


© i 0 
I= (P+a’)? ie 
Fic. 2.15. 


Using Cauchy’s theorem we then have 


pa flo) = 0 (2.79) 


C 
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where 

l 
[6 — (A? + a’)'? — ie)’ T? 
Since f(/p) > 19° as lọ > œ, the contribution from the circular part of 
contour C vanishes. Eqn. (2.79) implies that 


flo) = 


00 +100 


| dlo f(lo) = | dlo fo) - 
Thus, the integration along the real axis has been rotated to that along the 
imaginary axis. Change the variable /, = il, so that J, is real and 


+100 oe) 


| dlo flo) = i | dl, f Gil4) 
pap dl, 
~ | | @+B +B 4h +a? — ie? ee 


If we define Euclidean momentum k; = (I,,/,,13,1,) with k? =17 + 
12 + 12 + 1%, then the results in eqns (2.79) and (2.80) may be written 
d*/ 1 ee d*k l 
(2n)* (7? —a* +i (2n)* (k? + a? — ie)? 
where dtk = d/, dl, dl, d/,. Using polar coordinates in four-dimensional 
Euclidean space, we have 


(2.81) 


00 2n T 
[ate = | k? dk | do fs 6 dé | sin y dy (2.82) 
0 0 0 


and 


es d*k i 
(2n)* (k? + a* — ie)? = (k? + a? — ie)3 
o 


a cad 
= igs (k? + a? — ie)? ooo 
Using the formula for beta functions 
t”i dt — 1 Tmra-m) > 84 
(t F a?)" g (a2)"-" T(n) 4 ( . ) 
0 
we obtain 
d*k l 1 
(2.85) 


(2n)* (k? +a? — ie) 32m?(a? — ie) 
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or the vertex function in eqn (2.78) becomes 
1 
as —id? f da(l — «)(20 — 1)p? 
ee el aL (2.86) 
327 [ue — a(l — a)p* — iel 
0 


Since 0 < a < 1 we get u? — a(1 — «)p? > 0 for p? < 4p? and we can drop ie 
in the denominator. It 1s straightforward to evaluate the integral to give 


: ia 4u? —s5\t ; 
(p?) = Ms) = a 2 x ( T z) In[{(4p2 — s)* 


— (|s\)*}/{(4y? — s)? + nt for s<0 


i4? 4 -s$ f s \ š 
= Fa 2 > F ) tan (ga) f for O0<s<4u 


iA? s—4y?\? [st (s — 4y’)3 
= ———. 2 ee a a E N i 
327? ! T ( 5 i Ë + (s— oe int 


for s> 4p’. (2.87) 


With the same procedure, the divergent term I'(0) given in eqn (2.73) can be 
calculated 


1 


i14? A? a da 
aee fata has an 2.88 
DOr Gye | alu? — A?) + A? oon 
0 
For large A’, this gives 
i4? A? 
(0) x 3577 In = (2.89) 
Thus the one-loop contribution to the four-point function is 
Tii oplS, t, u) = 3T (0) + F(s) + Ñt) + Fu) (2.90) 


where the cut-off-dependent T (0) is given by eqn (2.89) and the finite F(s) is 
given by eqn (2.87). We have to add the counterterm (3iI'(0)/4!)@* to cancel 
these divergences. By (2.36) this corresponds to the renormalization constant 
3ir (0) 3A : A? 
A 32n? p? 
Having cancelled the divergences, the total four-point function up to this 
order is then given by (2.42) 
rs, t, u) = —id + P(s) + P(t) + Plu). (2.92) 


For the two-point function of eqn (2.6), corresponding to the graph in Fig. 
2.4, we have 


yn (2.91) 


—id [ d‘] i 
iE(p?) = l 
) 2 |(2n)* P —p? + ie (523) 
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This is a quadratically divergent integral and it can be regularized by 
choosing a, and a, in eqn (2.69) such that 


l a, a, 


boon Pane tae 6 
It is not difficult to see that we need 
2 2 
äs ae and ae AZ Az 
Then the modified propagator becomes 
l dı a» 
Pp +i PMA Fie P—A? tie 
(Aq — W’ A(A3 — u’) A* 


~ (P= YP = ARP = AD (P= PVP = AP 
for A, and A, both approach a large A. The regularized self-energy is 


d4] A4 
~iX(p?) = Rtas Sipe ee eas 3 
LAR) G7 (2 — p? + iel? — A? + ie) 
innt a da 
= 32n? | aA? + (1 —a)p? 
(0) 


—1iA A? 
= [a — p? n a (2.94) 


Since it is independent of the external momentum p, the Taylor expansion is 
trivial, 


2(p*) = X(0) > 


A7, 2.95 
32n? ae 
As we have mentioned before, this p-independence 1s a special property of the 
one-loop approximation in A@* theory. For a more general self-energy 
graph, X(p) will have a nontrivial dependence on p and the Taylor series 
around p? = 0 will be 


X(p?) = X(0) + p?X'(0) + X(p’) (2.96) 


where (0) and &’(0) are cut-off-dependent and (p?) is finite. And we have 
to add $X(0)” and 42’(0)(0,)* counterterms to cancel these divergences. 
To summarize, the total Lagrangian up to one loop has the form 


LY = LV 4 AG (2.97) 
where 
PO) — TOE uo?) Ku 


o 


ALY) = Oa 20)? To y (0)(0,0)”. 
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Combining terms of the same structure, we can write (2.97) as 


Z (u? — ôu’)? AZ, 
(1) _ Ze Osi Ne Se ee ee 
with 
Zo = 1 + 2'(0), 
ÀZ: ` = À + 3iI (0), 
bp? = L(0). (2.99) 


The values of these renormalization constants in the one-loop approximation 
are 


Z,=1 since 2'(0) =0, 


3A A? 
LZ, = l + 39,2 0 re 
ôu? = Ja A2. (2.100) 


If we express everything in terms of the bare quantities through eqns (2.49), 
(2.50), and (2.51), we find 


1 A 
L = 5 [êh — Hoos] — zr $6 (2.101) 


which is exactly the same as the unrenormalized Lagrangian (2.1) as it should 
be. 

Finally we comment on the convention used in making the Taylor series 
expansions (2.72) and (2.96) around p; = 0 to fix the finite part of the Green’s 
function. An equivalent way to state the same convention is to specify the 
normalization conditions of Green’s function. From (2.96), the finite part of 
the self-energy has the properties 


X(p?)|,2<0 = 0 (2.102) 
and é 
OX(p?)| 
ap? |r-0 0. (2.103) 


These properties imply that the full propagator 


1 


i 2) = a .104 
iAr( p“) p? — p? — Ê (p?) + ie (2.104) 
will satisfy the normalization conditions 
Ar '(P*)Ip2=0 = — 8? (2.105) 
and 
OAR ` 
opr ieo = |. (2.106) 


Similarly from (2.72) and thus from [(0)=0, we have from (2.92) the 
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normalization condition for the vertex function 


r0, 0,0) = —iå. (2.107) 


(Remark: Although (2.104) was originally derived with a Taylor expansion of 
X(p2) around p? = p? it also holds for the present p? = 0 expansion as a 
derivation entirely similar to eqns (2.14)—(2.22) will show.) 

In short, one can use conditions (2.105), (2.106), and (2.107) to replace the 
prescription ‘Taylor expansion around p; = 0’ to fix the finite part of Green’s 
function. 

In this connection we observe that the renormalized coupling constant 
defined by (2.107) differs from that defined by eqn (2.41) where a Taylor 
expansion has been made around the symmetric point Sọ = to = Up = 4y/3. 
It implies condition (2.33) 


TO'(so, lo, Ug) = —1iA (2.108) 


to be contrasted with (2.107). Thus, different Taylor expansions or 
subtraction points yield different definitions of the coupling constant. This 
leads to the concept of a running coupling constant (see Chapter 3). Clearly 
the physics should not depend on the choice of subtraction point which is 
purely a convention. In practice how is this apparent difference taken care 
of? Consider the two-body scattering cross-sections calculated using two 
different definitions of the coupling constant. The calculated cross-sections 
may appear to be different by an overall constant (the angular distributions 
are identical). But this is immaterial because we need to define the coupling 
constant operationally as the value of the cross-section at some kinematical 
point. Thus the difference is only apparent and the two seemingly different 
calculations really yield the same result. 


Dimensional regularization 


The basic idea of this scheme (t Hooft and Veltman 1972; Bollini and 
Giambiagi 1972; Ashmore 1972; Cicuta and Montaldi 1972) is that, since the 
ultraviolet divergences in Feynman diagrams come from the integration of 
internal momenta in four-dimensional space, the integrals can be made finite 
by lowering the dimensionalities of the space-time. Then the Feynman 
integrals can be defined as analytic functions of the space-time dimension n. 
The ultraviolet divergences will manifest themselves as singularities asn — 4. 
As before, the finite part can be obtained by subtracting out the first few 
terms in the Taylor expansion. This regularization scheme has the important 
advantage that it will not destroy any algebraic relations among Green’s 
functions that do not depend on space-time dimensions. In particular, the 
Ward identities, which are relations among Green’s functions resulting from 
the symmetries of the theory, can be maintained in this dimensional 
regularization scheme. For a review see Leibrandt (1975). 

We will illustrate this method with an example. Consider the one-loop 
four-point Green’s function in eqn (2.7) corresponding to the diagram in Fig. 
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2.5(a). It is proportional to the integral 


l l 
[= | d4/—_.___.—Y- => —;+ 2.109 

| ((—- př -g +ie P — p +ie ( ) 
which is logarithmically divergent. To define the integral in n-dimensional 
space, we take the internal momentum to have n components: 
lı = (los ls... l-1), while the external momentum has four nonvanishing 


components: p, = (Po, P1» P2, P3, 0...0). The integral in n-dimensional space 
is then defined as 
l l 

Kn) = |d" s 

(”) | d—py—ptie ? — p? +i 
which is convergent for n < 4. To define this integral for non-integer values 
of n, we first combine the denominators using Feynman parameters and 
make the Wick rotation (eqn (2.75)), 


1 


d”l 
S | ax | ((l — ap}? — a? + ie]? 


0 
1 


d”"/ 
0 


with a? = p? — a(l — a)p?. 
The integrand is now independent of the angles of the integration 
momentum, which can then be integrated out 


(2.110) 


oe) 2n n Ua 
fe = | pa a | dé, | sin 6, d0, | sin 0,d0,... 
(0) (0) (0) 


0 
m 
"an= 2 
x | sin 6,-, 46,-1 
(0) 


"1 dl (2.112) 


il 
e] N 
es a 
SŠ eS 
o ———. g 


where we have used the formula 
m+ 1 


n | 
2 
| sin” B a9 = SS. (2.113) 
m+2 


Thus eqn (2.111) may be written 


1 oO 
in’ [Pee dl 
= — | da | =: 2.114 
™ can ‘| P+ a? — iy si 
5 0 
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The dependence on n is now explicit. For complex n, the integral is well- 
defined as long as 0 < Re(n) < 4; the lower bound results from the apparent 
divergence of the integral at the /=0 limit. This infrared divergence is 
actually an artefact of our procedure as it is cancelled by the singularity in 
Tn) as n > 0. We can extend this domain of analyticity by integration by 
parts 


8 


00 


1 mid) =2 d l 

n\} [O +a? — ie]? _/n A [/? + a — ie]? 

T 2 0) B ztl O : 
(2.115) 


where we have used 
zI(z) =T(z + 1). (2.116) 


The integral is now well defined for —2 < Re(n) < 4. If we repeat this 
procedure v times, the analyticity domain is extended to —2v < Re(n) < 4 
and eventually to Re(n) > — œ. Thus the integral given in eqn (2.114) can be 
taken as an analytic function for Re(n) < 4. To see what happens as n — 4, 
we use (2.84) to evaluate the integral for n < 4, 


1 


Kn) = intr (2 = 5) | waa ane (2.117) 


0 


Using formula (2.116) 


we see that the singularity at n = 4 is a simple pole. If we now expand 
everything around n = 4 


2 
aay | eae ayer (ree ye (2.118) 
2 4—n 


a *=14+(n-4)Ina+..., (2.119) 


where A and B are some constants, we obtain the limit 
1 
dint. 2 2 . 2 
I(n) > z; i" da ln[u" — a(l —a)p*]+in*A. (2.120) 


n>4 E 
0 


With the one-loop contribution of (2.7) (T = 471/32n*), we have 


1 
2 
d — i | ae In[u* — a(1 — «)p7] vial. (2.121) 
n 
0 


i 
r(p) = ER - 


4 — 


3277 
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The Taylor expansion around p° = 0 gives 


r(p°) =r (0) + P(p”) _, (2.122) 
where 
A” 2i 
T (0) = ae a 2 j 
(0) pala ne +i4) 
iA? 
~x sy 
16n2(4 — n) ae) 
and 
1 
~ —i)? 2 _ a(1 — a)p? 
(p°) fa in| 2 La 
(0) 
1 


— iA? 5 — &)(2æ — 1)p? (2.124) 


[u — a(l — a)p7] 


where we have performed an integration by parts. Clearly the finite part is 
exactly the same as that given by the method of covariant regularization in 
eqn (2.86). Thus the finite part of Green’s function is independent of the 
regularization schemes as it should be and only depends on the subtraction 
point. The I (0) term diverges as a simple pole at n = 4 corresponding to the 
In A term (2.89) in the covariant regularization calculation. 

The one-loop self-energy (Fig. 2.4) is given by eqn (2.6) which in the 
dimensional-regularization scheme becomes 


ree ce l 
=d (p )=5 (2)* P—p? +ie 


iin? = 5) 
-p ia (2.125) 


Since, from eqn (2.116), 


t-dg 


the quadratic divergent term (2.95) has poles at n = 4 and also at n = 2. For 
n — 4 we have 


(2.126) 


iiu? / 1 
—i£(0) = = (7) (2.127) 


To compare the two regularization methods we list the results for the 
divergences in Table 2.1. Thus divergent Feynman integrals when evaluated 
in n-dimensional space appear as poles of the resulting I function at 
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n= 4, ... etc., keeping in mind that the quadratic divergence also has a pole 
at n = 2, see eqn (2.126). 


TABLE 2.1 
Covariant Dimensional 
regularization regularization 
o iA? | A? iA? 2 ) 
0) ni u? 327? \4 —n 
À Aà (2g? ) 
X(0 i 
(0) 327? 327? (; —n 


2.4 Power counting and renormalizability 


In the previous sections the renormalization procedure in Ad* theory has 
been illustrated in some detail. Here we will discuss the problem of 
renormalization for the more general types of interaction. The BPH 
renormalization procedure will be followed in this discussion. In a later part 
of this section, renormalization of composite operators will also be 
examined. 


Theories with fermion and scalar particles 


For simplicity we shall first concentrate on theories with spin-1/2 and spin-0 
particles only. For the Lagrangian density, Z = J + x; Z, where £ is 
the free Lagrangian quadratic in the fields and the Ys are the interaction 


terms (for example, Z, = IW p, ga(WW)*, gappe, Gah”, 95", ...), for 
a given graph we can define the quantities 


n; = number of ith type vertices; 

b; = number of scalar lines in the ith type vertex; 
Jf, = number of fermion lines in the ith type vertex; 
d; = number of derivatives in the ith type vertex; 
B = number of external scalar lines; 

F = number of external fermion lines; 
IB = number of internal scalar lines; 

IF = number of internal fermion lines. 


Thus for Z, = g,Wy,w õp we have b, = 1, f, =2, d, =1. From the 
structure of the graph we have relations like that of (2.58) 


B + 2(1B) = J nb; (2.129a) 
F + 2(IF) =) n; fi. (2.129b) 
Just as in (2.59), the number of loop integrations L can be calculated 


L = (IB) + (IF)—-n + 1 (2.130) 
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where 


n=n,. (2.131) 


The superficial degree of divergence D is then given by 


D = 4L — 2(IB) — IF) + ¥ nid; 
= 4 + 2(IB) + 3(IF) + > n,(d; — 4). (2.132) 
Using (2.129) we can eliminate IB and IF, 


where 


is called the index of divergence of the interaction Z,;. For Ad* theory, 6 = 0 
and (2.133) reduces to (2.61). In general ô; can be related to the dimension of 
the coupling constant in units of mass. Knowing that the Lagrangian density 
has dimension four and that the scalar field, the fermion field, and the 
derivative have dimensions 1, 3/2, and 1, respectively, the dimension of the 
coupling constant is given by 


dim(g;) = 4 — b; — èf — d, = —6,. (2.135) 


From (2.133) we see that, for a fixed number of external lines, the superficial 
degree of divergence will have different behaviour for the following three 
cases. 


(1) g; has positive dimension (or ô; < 0). Then D decreases with the number 
of ith type vertices. In this case Z; is called a super-renormalizable interaction 
and the divergences are restricted to a finite number of graphs. For example, 
consider the graphs for the two-point Green’s functions in the super- 
renormalizable 4¢ġ? theory. The one-loop diagram in Fig. 2.16(a) is divergent 
while the two-loop one in Fig. 2.16(b) is not. 


n s a e 


(a) (b) 
Fic. 2.16. 


(2) g; is dimensionless (or ô; = 0). Here D is independent of the number of 
ith type vertices. The divergences are present in all higher-order diagrams of 
a finite number of Green’s functions. Z; = gibt, g,Ww¢ are such examples, 
and they are called renormalizable interactions. 

(3) g; has negative dimension (or 6; > 0). In this case, D increases with the 
number of ith type vertices and all Green’s functions are divergent for 
sufficiently large n;. These types of interactions are non-renormalizable, and 


are exemplified by Z; = g,Wy, 0"¢, g.(WW)’, 930°, ... etc. 
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The index of divergence ô; is also related to the canonical dimension of the 
field operator. The latter is defined in terms of the high-energy behaviour of 
the free-field propagator, which is clearly relevant for power counting. Write 
the propagator for the free-field operator as 


D,(p?) = | d4x e7”: *<0]T(A(x)A(0))|0>. (2.136) 


If the asymptotic behaviour is of form 
Dap > (p*) n, (2.137) 
p2> œ 
then the canonical dimension for the field operator is defined as 
d(A) = (4 — @,)/2. (2.138) 
Thus for the scalar and fermion fields and their derivatives, we have 
Apy=1, dor"p)=1+n, 
dy) = 3, d(o"w) =F +n. (2.139) 


For composite operators that are polynomials in the fields the canonical 
dimension is the algebraic sum of the constituent fields: for example, 
dp’) = 2d(¢) = 2, dw) = 2d) + dQ) = 4. In the case of theories with 
fermions and scalars only, the canonical dimension of an operator is the 
same as that of the naive dimension in units of mass. But as we shall see later, 
these dimensions are different for massive vector fields. With these definitions 
and those in (2.128), the canonical dimension for each term in the interaction 
Lagrangian density becomes 


With the index of divergence ô; = d(¥,) — 4, we see that a dimension-four 
term corresponds to a renormalizable interaction, that less than four is super 
renormalizable, and that greater than four 1s nonrenormalizable. 


Counterterms 


Since the counterterms are constructed to cancel the divergences in the n- 
point Green’s function, their structures are closely related to that of the 
superficially divergent Green’s function. For example, we have seen that in 
Apt theory to cancel the quadratically divergent parts in the two-point 
function, we need counterterms (0,6)(0"@) with dimension 4 and ¢7 terms 
with dimension 2, while the logarithmically divergent four-point function 
needs the dimension-4 counterterm ¢*. In general we have to add counter- 
terms to cancel all divergences in Green’s functions with superficial degrees 
of divergence D > 0 as determined by (2.133). For convenience we will use 
the Taylor expansion around zero external momenta p; = 0 to isolate the 
divergent terms. The structure of the counterterms depends on the number of 
divergent terms in the Taylor expansion. For example, if a Green’s function 
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is quadratically divergent, the first three terms in the expansion will be 
divergent 


l(p,) =a + bY Pin + chip i, Diy + P"(p,). (2.141) 


The counterterms designed to cancel the a-term will have no derivative, the 
terms designed to cancel the b-term will have one derivative, etc. In the 
notation of (2.128) the counterterm will have the form Oa = (0,)*(W)"(o)" 
with «= 0, 1,..., D. For Ad* theory, for example, we have terms cor- 
responding to F=0, B=2 with «=0 and 2, B=4 with «=0. The 
canonical dimension of O,, is given by 


dı =3F + B+a. (2.142) 


The index of divergence of the counterterm can then be written through 
(2.133) as 


Det = da —4 
= (x —D) +) n; ôi. (2.143) 


Since « < D, we have the result 


Ôa < N; 6;. (2.144) 


Thus, the counterterms induced by a Feynman diagram have indices of 
divergence 6,, less or equal to the sum of the indices of divergence of all 
interactions ô; in the diagram. 

The renormalizable interactions which have ô; = 0 will generate counter- 
terms with 6, <0. If all the 6;<0 terms are present in the original 
Lagrangian, so that here the counterterms have the same structure as the 
terms in the original Lagrangian, they may be considered as redefining 
parameters like masses and coupling constants in the theory. These 
renormalized parameters are inputs of the theory and we need measurements 
of some physical processes to determine them. With these inputs, we can then 
predict the outcome of all other physical processes. For example, in 1¢* 
theory we have two free parameters, the coupling constant À and mass u. We 
can use the two-particle elastic scattering cross-section at two different 
scattering angles to determine the values of À and u. The cross-sections for all 
other angles and/or all other energies (and also all other inelastic cross- 
sections) can then be predicted. Much the same holds for super- 
renormalizable theories. On the other hand, non-renormalizable interactions 
which have ô; >0 will generate counterterms with arbitrary large 6, in 
sufficiently high orders and clearly they cannot be absorbed into the original 
Lagrangian by a redefinition of parameters. For example, the non- 
renormalizable interaction 4¢° which has ô = 2, will produce counterterms 
consisting of all even powers of ¢ and their derivatives: 7" and 67"2" with 
n,m = 1, 2,..., 00. We need an infinite number of measurements to fix the 
coefficients of these terms. Thus non-renormalizable theories will not 
necessarily be infinite; however the infinite number of counterterms 
associated with a non-renormalizable interaction will make it lack in pre- 


60 Introduction to renormalization theory 2.4 


dictive power and hence be unattractive, in the framework of perturbation 
theory. 

We will adopt a more restricted definition of renormalizability. A 
Lagrangian is said to be renormalizable by power counting if all the 
counterterms induced by the renormalization procedure can be absorbed by 
redefinitions of the parameters in the Lagrangian. With this definition, the 
theory with a single-fermion interaction with a single scalar through the 
Yukawa coupling Wy.w@ is not renormalizable even though the coupling 
constant is dimensionless. This is because the one-loop diagram of Fig. 2.17 
is logarithmically divergent and we need a $* counterterm. But such a term is 
not present in the original Lagrangian. The same theory with a ¢* interaction 


Fic, 2.17. 


is renormalizable. On the other hand, if a term can be excluded on symmetry 
grounds, then the renormalizability of the theory is not disturbed because 
higher-order terms will not generate such a term. For example, in a theory 
with only one scalar field, 


A 
L = H0,4) — rg? — 2 $8 


is renormalizable because it contains a// terms with 6 < 0 (equivalently with 
dimension less than or equal to 4) which are consistent with the symmetry 
@ — —d. The ¢° counterterm will be forbidden by such a reflection 
symmetry. Also, in this context we can understand result (2.133), or the more 
restricted Ad* result (2.61). The higher-order contributions to, say, a six- 
point function should be finite in (renormalizable) 4ġ4 theory. This must be 
the case because, if they were not, one would need a ¢° counterterm to 
absorb the divergences. Such a counterterm having 6 = 2 would ruin the 
renormalizability of the theory. 


Theories with vector fields 


Since the asymptotic behaviour of free vector-field propagators is very 
different for the massless and massive cases, we will discuss them separately. 


Massless vector field. In a theory with local gauge invariance such as QED, 
the vector field is massless. The asymptotic behaviour of the free propagator 
is mild. For example the Feynman-gauge photon propagator in QED is given 
by 

-gm 
k? + 18 k> œ 


A,,(k) = O(k~?). (2.145) 
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This implies that the photon field will have unit canonical dimension: 
d(A) = 1, like that for the scalar ¢@. Consequently power counting for a 
massless vector field is the same as that for the scalar field. Theories with a 
massless vector field will be renormalizable if they contain all interactions 
with dimension less than or equal to four and consistent with local gauge 
invariance. Denoting the massless vector field by A,, we have, for example, 
the dimension-4 operators 


Wy WA", P A,A", (0,0) pA". 


This in fact represents an exhaustive listing of all possible renormalizable 
interactions (i.e. dimension-4 or -3) of spin-0 and -1/2 fields with massless 
vector fields. The only possible dimension-3 operator (0,¢)A", which is 
bilinear in fields, is part of the free Lagrangian. 


Massive vector field. Generally the free Lagrangian for a massive vector field 
V, has the form 


Lo = —7(0,V, — ôV AOV" — OV") + 4M? V, V". (2.146) 
The vector propagator in momentum space 


=g J k ,k,/M?) 


Didi) = k? — M? + ie 


(2.147) 


has the asymptotic behaviour 


D,,(k) > O(1). (2.148) 
k> æ 


This means that the canonical dimension for the vector field is two which 
differs from its (naïve) dimension by a mass unit of one. The power counting 
is now modified with the superficial degree of divergence given by 


D=4—B-—3F—2V+ 3 n(A; — 4) (2.149) 


and 
A, =b; + 3 + 20, + d, (2.150) 


where V is the number of external vector lines, v; is the number of vector fields 
in the ith type of vertex, and A; is the canonical dimension of the interaction 
term £;. To have a renormalizable interaction we need A; < 4 but, from 
(2.150), the only such term trilinear in the fields is ¢*A,, which is not 
Lorentz-invariant. There is no nontrivial interaction of the massive vector 
field which is renormalizable. However, two important exceptions to this 
statement should be noted. 


(A) In a gauge theory with spontaneous symmetry breakdown, the vector 
(gauge) boson will acquire mass in such a way as to preserve the 
renormalizability of the theory. We will discuss this in detail in Chapter 8. 

(B) A theory with a neutral massive vector boson coupled to conserved 
current is also renormalizable. Heuristically we can understand this as 
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follows. The propagator D,,(k) given in (2.147) always appears between the 
conserved currents J“(k) and J‘(k) and the k,k,/M? term will not contribute 
because of current conservation. k"J,(k) = 0 or, in the coordinate space, 
"J (x) = 0. Then power counting is essentially the same as for the massless 
vector field case. 


Renormalization of composite operators 


So far we have only considered Green’s functions involving elementary field 
operators. In many practical applications we are interested also in functions 
of composite operators, i.e. local monomials of fields and their derivatives, e.g. 
Vb, $, $ O76, ete. 

Again we will illustrate the renormalization of such composite operators in 
Agt theory. Consider the composite operator 467(x). The Green’s function 
with one insertion of 4@*(x) has the form 


GX; Xi- Xn) = COLT? Dl) -PDO (2.151) 
or, in momentum space, 


(Qn)* ôt(p + py +.. -Ph )GS(P; Pis -> Pn) 


= [ats e` Px fi dx; e“ PE GY (x; Xo sees Xn): (2.152) 
i=1 


In perturbation theory we can use Wick’s theorem to work out Green’s 
function in terms of Feynman diagrams. For example, for G(x; x4, x2) to the 
zeroth order in 4 we have 


GHP (Xs x1, X2) = OTEP (x)H(%1)H(X2))10) 


= 1A(x — x,)1A(x — x3) (2.153) 
or, in momentum space, 

G®(p; Pı, —p — P1) = iA(p,) iA(p + pi). (2.154) 

If we truncate the propagators on the external lines, we have 
UP (p; pi. -P ~ Pi) =! (2.155) 
as represented in Fig. 2.18(a). The same Green’s function to first order in 4 is 

(—iA) 
G(x; x1, X2) = [asorar — $*(y))0> 


=) 
= fas — [A(x — yI’ iA(x; — y) iA(x, — y) 


with (amputated) 1PI momentum-space Green’s function given below (see 
Fig. 2.18(b)) 


Sd i i 
T(2)(p- Be iit ee $$ 
Dy(P; Pas —P — Pi) = \ oot FPF Soe 
(2.156) 
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P; p+p, Pi ptp, 
(a) (b) 
Fic. 2.18. Zeroth- and first-order diagrams of TS? (p; pi, —p — pı). 


We see that the composite operator generates a vertex very much like a 
term in the Lagrangian except that the composite operator can carry off 
momenta. This suggests the following method of systematically calculating 
Green’s functions with composite operators. As we have seen in §1.2, we can 
generate Green’s functions of elementary fields @(x) with the insertion of 
J(x)@(x) in the Lagrangian density, J(x) being an arbitrary c-number 
function. For a composite operator Q(x) we can similarly insert y(x)Q(x) in 
the Lagrangian density where y(x) is the c-number source function 


PEY] = FEO] + 7Q. (2.157) 


Following exactly the same procedure of constructing the generating 
functional W[y], which is the vacuum-to-vacuum transition amplitude in the 
presence of this external source y(x), we obtain the connected Green’s 
functions by first differentiating In W[y] with respect to y and then setting 
the source y to zero. With Q(x) = 4¢7(x) we have the vertex shown in Fig. 
2.19(a) which may appear for example in the one-loop four-point @ function 
in Fig. 2.19(b), 


(—id)? ( q4 i i 
r (p; pi. +. Pa) = 


2 (2n)* P — p? +ie (1+ py — py? + ie 
i 


ae eee 2.158 
“Up, p) x? + ie ( 


p p 
p 
Pı I itp 3 
Pi p+p, Pa I—p\—p, P4 
(a) (b) 
Fic. 2.19. 


We are now ready to discuss the renormalization of this new set of Green’s 
functions T® (p; p,...p,). The procedure is exactly the same as that for 
Green’s functions without Q(x), [(p,...p,). Since an insertion of a 
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composite operator is like an additional vertex, a straightforward application 
of (2.133) will show that the superficial degree of divergence Do of I'S” differs 
from D of I by the index of divergence ôo of the composite operator 


Dem DPbn=D + G =o, (2.159) 


where do is the canonical dimension of Q. Thus, for dg < 4, the insertion of 
(renormalizable or superrenormalizable) composite operators will not 
worsen the convergence property of the Green’s function; the insertion of a 
dy, > 4 operator worsens the divergence of the diagram. For the case of 
Q = $7, we have d(¢*) = 2 and, for an n-point function, D,: = 2 — n. Thus 
only I is logarithmically divergent and needs to be renormalized. The 
relevant one-loop diagram shown in Fig. 2.18(b) has the Taylor expansion 


Tp; pi, —P — Pi) = TGP; 0,0) + TSR; pi, —P — pi) (2.160) 
where I} is finite and has the normalization 
T2(0;0,0) =0. (2.161) 
We can combine the counterterm 
oe 
= £30; 0, O)x(x)62Cx) 
with the original term to write 
Z ye? -Z TRO; 0, Ong? =" Zyxd? 
with 
Zy: =1+10;0, 0). (2.162) 
Thus, the total contribution to T} up to one loop is 
TSR; Pi, =P — Pi) = 1+ TSR; pis —P — pi) (2.163) 
with the normalization 
P20; 0,0) = 1, (2.164) 


In general we need to insert the counterterm AQ into the original addition of 
(2.157). 


L —> L + (Q + AQ). (2.165) 


In particular, for the counterterm proportional to the original composite 
operator itself, AQ = CQ, as is the case with 4¢ġ°, we have 


Lix] = LO] + xZ_QQ 


= LEO] + YQ% (2.166) 
with 


Such composite operators are said to be multiplicatively renormalizable. This 
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means that the Green’s function of the unrenormalized operator Qg is related 
to that of the renormalized operator Q by 


GEX; X45 Xp) = COIT Qo(x)bo(x1), --- Po(X,))|0> 
Za GS x1, -.-Xq)- (2.167) 


The composite operator Q = $d? is multiplicatively renormalizable be- 
cause it is the only operator with dimension-two. For more general cases, 
AQ Æ CQ, the renormalization of a composite operator may require 
counterterms proportional to other composite operators. In this way 
renormalization may introduce mixings among composite operators. For 
example, for Q = ¢*, the counterterms AQ = ¢’, (0,)*, and @* will be 
needed. To be definite we will restrict our illustration to the case of two 
composite operators A and B which can mix under renormalization 


Fly] = LLO] + x ,(A + AA) + XB + AB). (2.168) 
The counterterms AA and AB are some linear combinations of A and B 


AA = C444 + C43B 


AB = Cg,A + CgpB. (2.169) 
We can write #[y] as 
A 
Lix] = FLO] + Qa, c| A (2.170) 
where 
l+ C C 
Gal TA a (2.171) 
Such a matrix C can be diagonalized with a bi-unitary transformation (see 
§11.3). Thus, 
Zy 0 
UCV’ = (2.172) 
0 Zp 


where U and V are unitary matrices. The Lagrangian can then be written 


Lix] = LO] + Ziz A + ZexegB 


A \ (4 2.173 
B) \B dada) 


Con Xp) — (X1, x_)U'. 


This means that the linear combinators A’ and B’ as defined by (2.173) are 
multiplicatively renormalizable 


CO|T(A'(x)B’(y)(1) --- Pn) 
= Z3! Zp’ Zz" O|T(Ag(X)Bo()Po(%1)--- Pox). (2.174) 


where 
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An example of such simple mixing involving only two composite operators is 
the theory defined by 
nee 1 a oe ee ae = a'h? 
P = wl" ô, — mY — 5 (0,9) — 5H Q ETK — gy ~ 3), 
(2.175) 


with A = @° and B = WwW. These two composite operators can mix under 
renormalization because of the divergences in the diagrams shown in Fig. 2.20. 


FIG. 2.20. One-loop divergent diagrams involving composite operators A = $? and B = Wy. 
The dashed lines represent @-fields; solid lines w-fields. 


3 Renormalization group 


The renormalization theory discussed in the last chapter has some arbitrari- 
ness related to our choice of kinematic points in defining physical parameters 
such as the mass and the coupling constants. For example, the BPH 
renormalization prescription only requires that the divergent part of the IPI 
graph be cancelled by counterterms constructed from Taylor expansions. 
However the reference points for the expansions are arbitrary. Different 
choices of the reference points, i.e. different subtraction points, lead to 
different definitions of the physical parameters of the theory. But any choice 
is as good as any other; the physics should not depend on the choices of the 
normalization conditions. This is the renormalization group: the physical 
content of the theory should be invariant under the transformations which 
merely change the normalization conditions. This seemingly empty statement 
actually provides us with highly nontrivial constraints on the asymptotic 
behaviour of the theory. In systems with infinite degrees of freedom (such as 
quantum field theory), renormalization can be defined in such a way that it 
involves a series of redefinitions of physical parameters on the relevant 
length or energy scales. There must be relations between the physical quan- 
tities so defined. Hence the renormalization group equation expresses the 
effect of a scale change in the theory or, more accurately, expresses the con- 
nection of renormalizability to scale transformations. 

Gell-Mann and Low (1954) were the first ones to use renormalization group 
techniques to study the asymptotic behaviour of Green’s functions in 
quantum electrodynamics. The renormalization group was discovered by 
Stueckelberg and Peterman (1953); its role in the Gell-Mann—Low analysis 
was discussed by Bogoliubov and Shirkov (1959). The recent interest in the 
applications of renormalization group has largely been brought about by the 
work of Wilson (1969). Our presentation is patterned after the lecture by 
Coleman (1971a). There are a number of ways to set up the renormalization 
group equation. In §3.1 we study this in the form of the Callan—Symanzik 
equation (Callan 1970; Symanzik 1970b) which is associated with momen- 
tum subtraction schemes. In §3.2 we briefly discuss the mass-independent 
renormalization or minimal subtraction scheme (t Hooft 1973; Weinberg 
1973a) and its associated renormalization group equation. The solutions to 
these equations in the asymptotic region are found in terms of the ‘effective 
coupling constants’ which are studied in more detail in §3.3. 


3.1 Momentum subtraction schemes and the Callan—Symanzik 
equation 


As stated above the existence of a renormalization group is related to the 
freedom one has in the choice of the reference points for Taylor expansions 
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leading to different definitions of the physical parameters of the theory. 
These choices may be expressed as different normalization conditions on 
certain 1PI amplitudes. The physical parameters should then be regarded as 
dependent on the choices of normalization conditions. We shall first 
illustrate this in A@* theory by giving two specific examples of mass- 
dependent normalization conditions (or momentum-subtraction schemes). 


Intermediate renormalization 


This corresponds to a Taylor expansion around zero external momenta. For 
the self-energy we have 


X(p?) = X(0) + L'(O)p? + X(p?). (3.1) 
The finite part (p?) will have the properties 
=(0) = 0 (3.2) 
d2(p’) 
=(). . 
a (3.3) 


The full propagator Ag(p”) is related to the self-energy (p°) by 


1 
iAR(p?) = = —=—— 3.4 


and the 1PI two-point function ['Q(p’) is given by 
TR (p*) = iAg(p*)[iAr(p*)]-* 
—i[Ag(p?)]~* 
—i[p? — p’ — X(p*)]. (3.5) 


The normalization conditions on (p?) (eqns (3.2) and (3.3)) can be 
translated in terms of I'(2)(p7) as 


reo) = p? (3.6) 
rK’ (p?) 
Op? p?=0 


For the four-point function, the finite part of the higher-order contribution is 
defined by 


= —1. (3.7) 


PPCP, P2» Ps) = TO (p1, p2, p3) — TO, 0, 0). (3.8) 
Thus we have 
PR(Pi,P2,P3)=0 at py =p: =p; =9. (3.9) 
Including the tree-level contribution 
rE P2» p3) = —iA + TR(P1, P2; P3), (3.10) 


the normalization condition on the total four-point function reads 


TR?(p1, P2, D3) =-i at py=p.,=p3;=9. (3.11) 
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We note that u? in this subtraction scheme is not the physical mass and that A 
is not the physical coupling constant because the points p; = 0 are not in the 
physically allowed region. But we can express all physically measurable 
quantities in terms of these two parameters. In this sense they are physical 
parameters. 


On-shell renormalization 


This corresponds to a Taylor expansion for external momenta on the mass 
shell, i.e. p? = u’. For the self-energy, this gives 


(p?) = E(u?) + (p? — w?)X'(u?) + Èp’). (3.12) 
Thus, 
Žu) = 0 (3.13) 
ôÈ(p? 
(p ) = (3.14) 
Op p2=p2 
Or, in terms of PY)(p’) of (3.5), 
PR (u*) = 0 (3.15) 
or 2( p? 
Ox te") =—l, (3.16) 
Op p? =u? 


For the four-point function, a convenient choice of the reference point for 
the Taylor expansion will be the symmetric momentum point 


rpi P2: P3) = —iA at p? =p’, 
s = t = u = 4u?’/3 (3.17) 


where s, t, and u are the Mandelstam variables. In this case the parameters u? 
and J are the physical mass and, up to some kinematical factors, the physical 
differential cross-section at s = t = u = 4/3, respectively. 

These two examples are specific realizations of a general renormalization 
scheme where the normalization conditions R can be a function of several 
fixed ‘reference momenta’, €,, €,... such that 


PEt) = w? (3.18a) 
(2) ,2 
Aky) = —] (3.18b) 
Op p? = 83 
and 
TEC Cas Es) = —1A. (3.18c) 


Renormalization group. Consider two different renormalization procedures, 
R and R’. Since both start from the same bare Lagrangian 


L = L (R-quantities) 
= £,(R'-quantities), (3.19) 
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in terms of the unrenormalized fields (see eqn (2.23)), we must have 


Pr = Z3 P (R)po; br = Z P (Roo. (3.20) 
Thus, 
Pr = Zg P (R', Robe 
where 


Za(R', R) = Z4(R)/Z4(R). (3.21) 


This means that the renormalized fields in different subtraction schemes are 
related by a multiplicative constant. Since both @p and dz are finite, 
Z,(R’, R) must also be finite even though it is a ratio of two divergent 
quantities. Similar relations between the coupling constants, masses, and 
Green’s functions can be worked out 


An = Zi R; R)ZŽ(R', R)hr (3.22) 
ug = UR + Op?(R’, R) (3.23) 
where 
Z,(R', R) = Z,(R')/Z(R) (3.24) 
ôu’ (R', R) = du?(R’) — ôu? (R). (3.25) 


are all finite. The operation which takes the quantities in one renormalization 
scheme R to quantities in another scheme R’ can be viewed as a 
transformation from R to R’. The set of all such transformations is said to 
form the renormalization group. We now translate this renormalization group 
invariance into the analytic form. 


Callan-Symanzik equation 


First we note that differentiation of the unrenormalized Green’s function 
with respect to the bare mass is equivalent to an insertion of the composite 
operator Q, = 4¢ġ% carrying zero momentum 
or (p;) 
——— = —iT (0; p; 3.26 
ie $200; pi) (3.26) 
because I)(p,) depends on yé only through the bare propagator 


i^o(p) = : (3.27) 


p? — u + ie 
and because 
In terms of the renormalized (1PI) Green’s functions, we can write 
De (p:; A, u) = Zg T™ (pi; 20, Ho) (3.29a) 
TSR(P, Pi; A, u) = Zi ZE TSP, Pi; Ao» Ho). (3.29b) 
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After substituting (3.29) into (3.26) and using the following relation 
0 ôu? ô 0A ô 
— T(p; 4, wW) = | = ae tao SIT OO å, w), 3.30 
we have the Callan-Symanzik equation in 4¢ġ4 theory 


ô ô | 
c aa t pa m | TR (pis A, u) = — ip? ol $O, pi; 2, u) (3.31) 


where a, p, and y are dimensionless functions 


dA/O us 
= 27 32 
ô In Z,,/Oue 
2 $ 0 
= W ———_ 33 
Pema ck a Ap 34 
ôu’ /6u9 sre 


The function « is related to y: forn = 2 we have the normalization conditions 
(3.6) and (2.164) 


rR(0; 4, w) = ip? and TY,(0, 0; 4, w) = 1. (3.35) 
Hence, from eqn (3.31), 
a= 2(y — 1). (3.36) 


Since the renormalized quantities T® and '$?, are both cut-off independent 
to all orders in J, we expect that the functions a, p, and y are also cut-off 
independent. To see this explicitly we set n = 2 in (3.31) and differentiate 
with respect to p? 


0 0 0 l 0 
E ai +$ al 2 | ap? rR(p; A, p) = ~ina- TPO, p; A, y). 


Op 
(3.37) 
Set p? = 0 and use the normalization condition (3.7) 
OV} (p?; A, 
ER A e a (3.38) 
Op p?=0 
Then (3.37) turns into 
0 
y= p(l — y) È TBR (0, p*, À, W| (3.39) 
p p?=0 


This demonstrates that y is cut-off independent. Every function except f in 
(3.31) is now independent of the cut-off; hence f is also cut-off independent. 
Since, «, p, and y are dimensionless, the cut-off independence implies that 
they are functions of the dimensionless coupling constant only, i.e. « = a(A), 
B = P(A), and y = »(A). 

In practical calculations of «, B, and y it is convenient to use the cut-off (A) 
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dependence of the renormalization constants Z}, Z, as follows. In un- 
renormalized perturbation theory with unrenormalized A, and po, the 
renormalized parameters u and A, defined in (2.51) and (2.50), 


K? = uo + dp? (3.40) 

and 
A=Zdo (3.41) 

with 
Z27,°22 (3.42) 


are functions of Ap, Ho, and A. From dimensional argument, A and the Z;s 
can depend only on dimensionless quantities like 4) and A/o. If we further 
replace uo by u = L(g, Ho, A), we have À = A(Ay, A/u) and Z; = Z; (Ao, A/p). 
Using the chain rule of differentiation 


ô ôu? ô 
—sz Ao, A =— — Alia, A > 3.43 
TE (Ao, A/y) M T (Ao, A/p) ae (3.43) 
we have 
0 
B= n= Ulos Nai 
u 
a — 
=i o LZ (20, A/M)o da, ao 
u 
E 
R — ÀA OA [Z (Ao > A/H)]u, ao (3.44) 
or 
0 = 
=— 3.45 
B= -A= [In Z(2o, AV] (3.45) 
Similarly, we obtain 
= fou In Z,(Ap, A/p) | (3.46) 


This means that to calculate the Callan—Symanzik f and y functions we only 
need to know the In A term in the Z;s. At the one-loop level we have (eqn 
(2.100)) 


A 3277 
Z = 1 + 0(43). 


3A, . A? 
Z,=14+——5n—, + O(12) 
u 


Hence 
3A? à 
B(A) = T + O(å?) (3.47) 
y(A) = O(7). (3.48) 


The generalization of the Callan—Symanzik equation to Green’s functions 
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involving several composite operators A, B, C ... can be carried through in a 
straightforward manner. First we choose the appropriate linear combination 
of operators such that they are multiplicatively renormalizable (see §2.4). 


(Gaps pea Lg key GR: Ti (3.49) 
or 
T ae 27 Lp kee oy Ae ees (3.50) 


The Callan—Symanzik equation can be readily shown to be 


0 0 
c au + B Al ny + Ya. {TG}... $e = —ipa{T aB.. Yr (3.51) 
with 
0 


l 
YAB... = -AA nA In[Z,Z, ae ale 


Weinberg’s theorem on the asymptotic behaviour of Green’s function 


The large-momentum or short-distance behaviour of Green’s function is 
clearly of great interest. It is related to the renormalizability properties of the 
theory. An important theorem here is the one due to Weinberg (1960). It 
concerns this behaviour for nonexceptional values of momenta in the 
Euclidean region. In the Euclidean region all momenta are space-like, p? < 0, 
which can be realized most easily by having real space and imaginary time 
components. A momentum configuration p,,p,,...,p, is said to be 
nonexceptional if no nontrivial partial sum vanishes, p;, + Ppi, +... Di, = 0 
for ii, i2, ..., i, take on any of the labels, 1, 2,...,. (A trivial partial sum 
which vanishes would be p, + p2 +... Pa = 0 because of the overall energy— 
momentum conservation.) 

Again we state without proof Weinberg’s theorem. If the momenta are non 
exceptional and parametrized as p; = ok;, the \PI Green’s function TY grows 
in the deep Euclidean region (corresponding to o > œ with k; fixed) as o*~" 
times polynomials in ln o to any finite order in the coupling À. Similarly Tg 
grows as o*~" times polynomials in In ø. 

We note that the powers of o for TẸ and Ty, are just their superficial 
degrees of divergence (see Chapter 2), which are also their (naive) dimensions 
in unit of the mass. 

For convergent diagrams it is not difficult to understand this result 
intuitively. For a nonexceptional external momentum configuration, the 
hard momenta must flow through the internal loops and set the scale for the 
loop integration momenta as well. (For an exceptional momentum configura- 
tion this need not be true.) This explains why the same degrees of divergence 
appear in our study of the large internal momentum limit and of the large 
external momentum limit. For divergent diagrams the result stated by 
Weinberg’s theorem may not be so obvious. One would expect that the 
ultraviolet portion of the integration would be controlled by the cut-off A 
even for hard external momenta. However, the cut-off-dependent part is 
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cancelled when the necessary counterterms are included. The surviving 
leading contribution again corresponds to the portion of the loop integration 
with momentum of the same order of magnitude as the hard external 
momenta. To illustrate this remark, consider the one-loop four-point 


function in Fig. 2.5, 
d*/ 

0» a 
In the three integration regions, we have I ~ In A for! > p;T ~ In pforl ~ p; 
I ~ p~? forl « p. After the inclusion of the counterterm of Fig. 2.9(c), the ln A 
term is cancelled and replaced by some term constant in p. Thus the dominant 
asymptotic behaviour comes from the region of integration where | ~ p. This 
is why the power of c in the asymptotic behaviour is the same as the superficial 
degree of divergence. In this particular case, we have D = 4 — n = 0; we 
expect from Weinberg’s theorem the asymptotic form T = (øo?) x 
(polynomial in In a). This agrees with the estimate given above. 

We note that, in the deep Euclidean region, particles are very off their mass 
shell p? > u’. Nevertheless, as we shall see later in Chapters 7 and 10, in cases 
such as deep inelastic lepton scatterings we can still extract useful infor- 
mation with the help of the operator product expansion. 

Weinberg’s theorem tells us that Green’s function in perturbation theory 
takes on the asymptotic form 


T™(op,,4, u) > ot "[ap(In o)° + a,(Ino)tA + ...] (3.53) 
with the constants a; and b; unspecified. Thus, it leaves open the question as 
to what the power series in polynomials of In o sums up to. If this sums up to 
some power of a, say a’, then y will be called the anomalous dimension as it 
modifies the canonical behaviour o*~" to o*~"~’. Clearly, we would like to 
learn all we can about the anomalous dimension y. 


The asymptotic solution of the renormalization-group equation 


If we can ignore the inhomogeneous term on the right-hand side of eqn (3.31) 
involving mass insertions, the Callan—Symanzik equation can provide 
information on the asymptotic behaviour of Green’s function. As it relates 
quantities of different orders in the coupling (u(0/du) ~ O()), 
B(0/0A) ~ O(A), and y is of even higher order), the equation can be viewed as 
some kind of recurrence relation among the a;s and b;s of (3.53). Thus the 
asymptotic solution of the Callan—Symanzik equation should be relevant to 
the study of the true large-momentum or short-distance behaviour of Green’s 
function. In other words, this renormalization-group equation sums up all 
the leading logarithmic terms to all orders of the perturbation series. 
From Weinberg’s theorem we have TẸ >T% to any finite order of A in 
the deep Euclidean region (o — oo). If we assume that this is true even when 
the perturbation series is summed to all orders, we can then drop the right- 
hand side of the Callan-—Symanzik equation (3.31) and obtain a homo- 
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geneous differential equation 
0 -0 
c ai + PA) = — m| Ds) (pis A, u) = 0 (3.54) 


where I is the asymptotic form of IY”. Thus in the deep Euclidean region, a 
small change in the mass parameter (the u(0/ðu) term) can always be 
compensated for by an appropriate small change in the coupling (the B(0/0/) 
term) and an appropriate small rescaling of the fields (the —ny term). 
First we replace the change 1n mass parameter trivially by the correspond- 
ing change in the scale parameter. From dimensional analysis we can write 


rO (p;, A, u) = t "TR (pi /p, A) (3.55) 


where I? is dimensionless and satisfies 


0 OP Vers 
Z BEN (n) - = 
We have from (3.55) and (3.56) 
Ô 0 
u=— +o + (n—4) |T3(op;/p, A) = 0. (3.57) 
Ou Oo 


The asymptotic form of the Callan—Symanzik equation can be written 


fo 5 — play S + mld) + (2 = 4) | PPD, A 1) = 0. (3.58) 
alai OA 


To solve this equation we first remove the nonderivative terms with the 
transformation 


À 
Ti (op;, 4, u) = 047" exp| = dx [Fop å, u). (3.59) 
p(x) 
Thus, 0 
a BA) £ F™ (op, A, n) = 0 (3.60) 


For convenience, define t = In o. We need to solve 
© _ gay | Fep, A, u) = 0. (3.61) 
Ot OA cme l 


In order to do this we introduce the effective, or running, coupling constant À 
as the solution to the equation 
dA(t, A) 7 
EP = B(A) (3.62) 


with the boundary condition A(t = 0, 4) = A. To obtain another form of 
(3.62) we first integrate it with respect to t 


A(t, A) 


t= | dx/B(x), (3.63) 
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then differentiate both sides with respect to 4 
1 då 1 


pada BA) 


or 


ð a]. 
È — B(A) Z| A(t, a) = 0. (3.64) 


Thus if F depends on t and 4 through the combination of A(t, A) it will satisfy 
(3.61). T® must have the form 


A 


TP (op,;, a, u) = 047” exp pe dx [Fp A(t, à) u). (3.65) 
B(x) 


0 


We can write 


rl i a 
y(x) y(x) y(x) 
—— dx | ~ —— d —— d 
exp r | B (x) J expl n | B (x) x+n | B (x) J 


A 


= H(A) exp| -n e ax 


t 


-= H(A) “E | y(A(t’, A) ar | (3.66) 
(0) 


where 
I 


H(A) = exp | ax | 
J (x) 


t 


Topi, A, u) = 07" exp| —7 | ace 4)) ax | (A(t, 4))F™(p;, A(t, 4), u). 


0 


Thus, we have 


(3.67) 


If we set t=0 (o= 1) in eqn (3.67), we see that the combination 
H(A)F™(A) is just 1. Therefore, 


t 


Papi, A, u) = 047" exp| | ee A)) ax | Pe (pi, A(t, 2), w). (3.68) 
0 


In this form the asymptotic solution [ has a simple interpretation. The 
effect of rescaling the momenta p; in the Green’s function IY is equivalent to 
replacing the coupling constant / by the effective coupling constant 4, apart 
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from some multiplicative factors. The first factor o*~" in (3.68) is the 
canonical dimension coming from the fact that rT® has dimension 4 — n in 
units of mass. The exponential factor in (3.68) is the anomalous dimension 
term which is the result of summing up the leading logarithms in 
perturbation theory. This factor is controlled by the y-function. Thus y is 
often called the anomalous dimension (Wilson 1971). 

The result in this section may be viewed as follows. The expectation that in 
the large-momentum limit masses become negligible and theory should be 
scale-invariant is too simple. Even without physical masses the renormaliz- 
able theory still has an energy scale as we must always impose normalization 
conditions at some mass scale. Thus naive dimension analysis is generally 
inadequate and scale invariance is broken. However the dependence of the 
theory on this normalization mass scale is given by the renormalization- 
group equation which expresses the effect of a small change of scale. In 
favourable cases when the inhomogeneous term in the Callan—Symanzik 
equation may be dropped the solution indicates that the asymptotic 
behaviour displays a certain universal character with operators being 
assigned anomalous dimensions. 


3.2 The minimal subtraction scheme and its renormalization- 
group equation 


In this section we will illustrate other forms of the renormalization-group 
equation. Again let us examine the multiplicative renormalizability statement 
(3.29a) which may be written as 


DP (pi; Ao, Ho) = Zp "Tp; A, y K). (3.69) 


If we regard the bare parameters Ap, Ho, dp as independent variables, then the 
renormalized quantities are functions of these bare parameters and the 
normalization scale parameter x. In this form, the right-hand side of (3.69) 
depends on x explicitly as well as implicitly through the definitions of 4 and u. 
However the left-hand side is independent of x; we then have 


0 0 ô 
S = — — T® = 0 3.70 
with 
u On 
eye 3.71 
u ôln u 
ei z= ie 
ml 2 3 K OK ( ) 
u 1 dlnZ, 
À, — | = - k ——— 3.73 
( 3 2° aK i) 


Compared to the Callan—Symanzik equation (3.31), this renormalization- 
group equation has no inhomogeneous term to begin with. We will try to 
approach it with a procedure similar to that used in solving the asymptotic 
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form (3.54) of the Callan—Symanzik equation. But the coefficients p, y,,, and 
y are now dimensionless functions of two variables 2 and u/k which makes 
the solution difficult. However, in contrast to the momentum subtraction 
schemes discussed in §3.1, there exists a mass-independent renormalization 
procedure in which the mass dependences of these renormalization group 
equation coefficients disappear. We now give an outline of this subtraction 
scheme due to °t Hooft (1973); see also the discussion in Ramond 
(1981). 


Minimal subtraction scheme 


This renormalization procedure is particularly suitable for dimensional 
regularization. Here the divergences show up as poles when the dimension 
n —> 4. The minimal subtraction scheme consists of adding counterterms to 
cancel these poles. In other words, the counterterms have no finite parts. 

As an example, consider the one-loop self-energy (Fig. 2.4) in 1¢* theory. 
In momentum subtraction schemes of §3.1 the presence of the arbitrary mass 
scale k is obvious (e.g. as the normalization point). In the dimensional 
regularization one also needs to introduce a mass scale x to compensate for 
the naive dimensions of coupling constants and masses: A > (k)*~"A and 
u > ku. We have 


; —1AK® | d”l 1 
oi 


ee n/ 27 Ls ey 2 E 
_ IAKÊ N ( n/2) _ 1A T(-1 a ¢/2) (=) Egel 2 (3.74) 


~ 2(2n)" wor 32n? 
where 
é=4—n., (3.75) 
To make an expansion around e = 0, we use the formulae 
—1)"T] 
oa a + vn + 1) + 000) (3.76) 
a = e" —~]4+¢lna+ O(e) (3.77) 


where 


l | 
Wn+1)=1l+=2+...--y 
2 n 


and y = 0.5772 ...is the Euler constant. Thus as € —> 0, eqn (3.74) becomes 
—iAp? 
32n? 


—i£(p?°) > f + w(2) + 2 In(k/p) + 2In2/n + a | (3.78) 


Thus the counterterm X(0)@7/2 to be added, as in eqn (2.62), in the minimal 
subtraction scheme is 


AG ga oy. (3.79) 


This is to be contrasted with the counterterm (Ap?/32n7)[1/e + 3W(2) + 
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In(x/u) + In 2./2]? that one would have added in the momentum subtrac- 
tion schemes. Thus the minimal subtraction counterterm Lagrangian when 
expanded in a Laurent series in ¢ will contain only divergent terms. The 
relations between the physical 4, u, and @ and the bare parameters are 


Ag = ea + Ş ae | (3.80) 
Up = di +) baie | (3.81) 
do = of +5 cele | = Z5". (3.82) 


Thus the coefficients, reflecting the same property of the counterterms, are 
independent of the arbitrary parameter x and (since they are all dimension- 
less) the particle mass u. Hence this minimal subtraction scheme is also called 
the mass-independent renormalization procedure. One can easily understand 
this feature as the counterterms have no finite part; they just have the ‘bare- 
bone’ structure needed to cancel the infinities at very large momenta where 
the theory is not sensitive to its masses (provided the amplitude are well- 
behaved as p > œ). 

To calculate the renormalization-group parameters of eqns (3.71)-(3.73), 
we use the fact that the bare quantities are independent of x. Thus, from 
(3.80), 


OA < Léa, da 
+ (ai +0) + > ae oe + es [= 0. (3.83) 


r=1 E 
Since k(0A/Cx) is analytic at € = 0, we can write 
OA 5 
Ka = dy + de + aoe +... (3.84) 


From (3.83), it is clear that d, = 0 for r > 1 and 


d l da, da, 
s+ dh) + (a, tdo + ds St) +E | das tdo ge + A afo 


då dà 
(3.85) 
which implies that 
(A+d,)=0 
da 
a, fe di Pa = — do 
d da, 
(i + d, 3 d;+1 = — do dà (3.86) 


Thus, 


80 Renormalization group 3.2 


or 
da 
B(A) = —a, + A (3.87) 
We also have 
d d 
(1 —A 5) La,41(4) — a,(4)] = a a,(A). (3.88) 
Similarly, we have, from (3.81), 
ô In db 
Ym = K = = A (3.89) 
db... db, db, d 
A rae b,A TET (i ai) a,(A) (3.90) 
and, from (3.82), 
1 dlnZ, de, 
cane ae pens 3.91 
eo aK d on 
dèi dc, dç, d 
À qd A qi di (i 15; al) (3.92) 


Thus eqns (3.87), (3.89), and (3.91) enable us to calculate p, y,,, and y directly 
from the residues of the simple poles a,, b,, and c,. The recursion relations 
(3.88) and (3.90) are useful in computing the residues of the higher-order pole 
terms in terms of the simple pole. (This is the same reason why the leading 
logarithms, the next-to-leading logarithms, etc. can be calculated to all 
orders by using the renormalization-group equation with the computation of 
just a few low-order terms.) Here we will just make a simple check that the f- 
function result agrees with previous calculation. From eqns (2.50), (2.63), 
and (2.123) we have 


Ao = Zy°Z,A 
with 
AZ, =4A-i1 (0) =A + wes (3.93) 
a R 16n? € i 
Thus, a, being quadratic in 4, A(da,/d/) = 2a, and 
34? 
B=a,= 16,2 (3.94) 


which agrees with (3.47). 

The fact that f, y, and y,, in this subtraction scheme are functions of A only 
will simplify the solution to the renormalization-group equation (3.70). The 
procedure will be similar to the steps of eqns (3.55)—(3.68). From dimensional 
analysis, eqn (3.70) can be written as 


ô ô ô l 
jos = B(A) oA = (Ym — 1) ôu T ny(A) + (n = J rR(op;, H, À, K) =0. 
(3.95) 
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To solve (3.95) we now introduce not only an effective coupling constant 4(t), 
but also an effective mass [i(t) (with t = Ino as before) 


dit) _ 
= BO (3.96) 
di = 
AO = (8) — AO 3.97) 
with boundary conditions 
At =0)=A (3.98) 
u(t = 0) = u. (3.99) 


The solution to eqn (3.95) may be written down: just as eqn (3.68), 


t 


PR (opi, H, À, K) = 0%" "E | y(A(t')) ar | rR (Ôp: H(t), A(t), K). 


In this formulation the large-momentum limit (3.54) and the validity of the 
asymptotic solution (3.68) hinge on whether or not the effective mass A(t) 
vanishes in the deep Euclidean limit t > oo. 

We should remark that in the momentum-subtraction schemes of §3.1, one 
can also introduce the arbitrary mass scale u as the subtraction point to 
derive a homogeneous renormalization-group equation (Weinberg 1973a). 
But then the functions y, B, y,, will depend on (m/) in addition to depending 
on the coupling constant A. This will cause some difficulty in solving the 
renormalization-group equation. In practice one can get around this by 
choosing the subtraction point u large enough so that the dependence on 
(m/p) of y, P, and y,, can be neglected. 
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Apart from the trivial dimensional factor o*~", the Green’s function 
r®(op;, A, u) in the deep Euclidean region with o > œ (or e' > œ) depends 
on c only through the effective coupling constant A(t, A), which we will 
concentrate on in our study of the asymptotic behaviour of Green’s function. 
As we discussed in §3.1, the definition of the coupling constant A depends 
on the subtraction point. For example in the intermediate renormalization 
scheme, the four-point function in the 1@* theory is given by eqn (2.42), 


T((s, t, u) = —id, + F(s, 4) + F(t, 2) + Tu, Ay) (3.100) 


where Î (s, A) is given by eqn (2.86), and A, is the coupling constant defined in 
the intermediate renormalization scheme. The cross-section for two-particle 
elastic scattering is related to the four-point function by 


do _ 1 1 
dQ 64r? s 


42, (3.101) 
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But in the on-shell renormalization scheme the coupling is defined differently 
with the four-point function expanded around the symmetric point 
s = t = u = 4p’7/3. 


rÉ (s, t, u) = —ids + T(s, As) + s(t, As) + Psu, As) (3.102) 
where 


F(s, As)ls-4423 = 0 and PO(s=t =u = 42/3) = —ilg. 


Since the cross-section (3.101) should be the same in these two schemes of 
renormalization, we have 


Tis, t, u) = Ts, t, u). (3.103) 


Evaluating both sides at the symmetric point, Sọ = to = Uo = 47/3, eqn 
(3.103) implies 


—ids = ~i + Ñ(so, Ay) + Ñ(to, Ay) + Ñ (uo, 41). 


This gives the relation between coupling constants defined by different 
subtraction schemes. Clearly the subtraction point can be taken at any point 
in the physical or unphysical region. And the coupling constant in any 
renormalization scheme should be regarded as a function of the subtraction 
point. In this sense the coupling constant is energy-dependent and is called 
the effective, or the running coupling constant. 

There is another way to look at the running coupling constant. It simply 
reflects the effect of the leading radiative-correction terms. In perturbation 
theory the effective expansion parameter is actually the coupling constant 
multiplied by some logarithmical factors. Normally one picks the normaliza- 
tion point to be of the same order of magnitude as the typical momentum 
scale of the problem. The argument of the logarithm, which is typically the 
ratio of these scale factors, is then generally of order one. However, for a 
problem involving a large range of energy scale, the radiative correction 
through these large log factors can be substantial. The solution to the 
renormalization-group equation simply represents the summation of these 
logarithmic factors to all orders of perturbation theory. 

The running coupling constant 4 satisfies the differential equation (3.62) 


di(t,a) n 
a = pA (3.104) 


or more explicitly as a renormalization-group equation (3.64) 


ð ôN; 
G — p<) A(t, a) = 0. (3.105) 


Thus the change in the effective coupling À induced by the change in energy 
scale is governed by the renormalization-group f-function. To study the 
asymptotic behaviour of A let us suppose that B(A) has the form shown in Fig. 
3.1. The points 0, 4,, and 4, where f vanishes are called fixed points. If the 
coupling constant 4 is at any one of these points at t = 0, it will remain there 
for all values of momenta. Furthermore, we can distinguish two types of 
fixed points. Consider the neighbourhood of /4,. Because f(A) > 0 for 
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O0<A<jA,, the effective coupling constant À in (3.104) increases with 
increasing momenta (i.e. d//dt = B(t) > 0) and is driven to 4,, as t > œ. As 
B(A) < 0 for A, < å < å, it decreases with increasing momenta and is again 
driven to 2, . Thus in the interval 0 < 4 < 4,, the coupling constant / is always 
driven to J, for large momenta. A, is called an ultraviolet stable fixed point. By 
similar argument it is straightforward to see that in the neighbourhood of 0 
and A, the coupling will be driven to these points for small momenta, i.e. as 
t + 0. Hence the origin and 4, are examples of infrared stable fixed points. 


PCA) 


A\ Az A 


Fic. 3.1. An example of the Callan—Symanzik f-function exhibiting an ultraviolet stable fixed 
point at A, and infrared stable fixed points at 0 and 1,. The direction arrows indicate how the 
coupling constant will move for increasing momenta. 


Now we can study the asymptotic solution of the Callan—Symanzik 
equation. Suppose 0 < 4 < 4. Then 


lim A(t, A) = A, (3.106) 
t> co 
and 
(pi, At, A), a) > T2(p;, Ay, u). (3.107) 
t> oO 


For purposes of illustration let us assume that B(A) has a simple zero at A, 
and that y(A,) does not vanish; then we have in the neighbourhood of 4, 


B(A) ~ aA, — A) with a>0. (3.108) 
From 
di 
= a(A, — A), (3.109) 
we obtain 
yi Åi +(A-—A,)e™. (3.110) 


Thus for (3.108) the approach of å to J, is exponential in the variable t. In the 
same approximation, we have 
t pi 


7 _ |r) dy 
[raa À)) dx = | By) 
(0) A : 
_ = yay) fa? 
= a AA, 


A 


= WA), À — Ad, 
a | A—A, 


= (åt = yA) Ing. (3.111) 
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Thus the particular realization of (3.68) takes on the form 


lim TY(op;, A, p) = 047P COP (pi, Ar, y). (3.112) 
This means that in the deep Euclidean region, the field scales with anomalous 
dimension y(A,) and Green’s function takes on a value with å replaced by 44. 
In general it is difficult to calculate the zeros of the B-function since this 
requires results beyond perturbation theory. However f(A) has a trivial zero 
at the origin 2 = 0, where the anomalous dimension (A) also vanishes. 
Besides the practicality of calculating P(A) for small À it turns out that this 
may have particular phenomenological relevance. As we shall discuss in 
Chapter 7 deep inelastic lepton—hadron scattering probes the large- 
momentum behaviour of products of hadronic electromagnetic (or weak) 
currents. The observed phenomena of Bjorken scaling can be interpreted as 
indicating that the product of these currents has the free-field singularity 
structure. Hence, if we can find a field theory which has an ultraviolet stable 
fixed point at the origin 4 = 0, it may be taken as a candidate theory for the 
hadron constituent (quark) interactions. In other words, the Bjorken scaling 
phenomena in deep inelastic lepton—hadron scattering may be explained if 
the effective interaction among quarks vanishes in the short-distance limit. 
This suggests that a theory of quark interactions should have the feature that 
it become a free-field theory in the ultraviolet asymptotic limit (asymptotic 
freedom) and one needs to calculate the B-function and to see whether f(A) < 0 
for A 20. 
For A¢* theory, from (3.47) we see that it is not ultraviolet asymptotically 
free. More explicitly we can integrate (3.104) 


då 32? 
ap 16a? (3.113) 
to obtain 
_ A 
A= —37 (3.114) 
1677 


where A = A(t = 0, A). Of course (3.113) and (3.114) are valid only for small 
4. We have dropped higher-order terms in J. Had it been applicable for large 
couplings also, eqn (3.114) would predict that interaction strength would 
blow up at the ‘Landau singularity’ of t = 1627/32. 

The f-functions for other theories will be discussed in Chapter 10. It will be 
shown in particular that no theory without a non-Abelian gauge field can be 
asymptotically free. 

We can summarize this introduction of the renormalization group and its 
effective couplings as follows. The aim of the renormalization-group 
approach is to describe how the dynamics of a system evolves as one changes 
the scale of the phenomena being observed. Generally one is particularly 
interested in the behaviour of the system at extremely small (ultraviolet) or 
extremely large (infrared) limits of the scale. These renormalization-group 


3.3 Effective coupling constants 85 


transformations (of the effective theories at different scales) after some 
iterations often have the property of approaching a fixed point in these limits. 
The attractive feature is that the behaviour of effective theory at the fixed 
point is relatively insensitive to details of the theory at ordinary length scales 


and in some cases these fixed-point effective theories are particularly simple 
to calculate. 


4 Group theory and the quark 
model 


Ever since Einstein, symmetry has played a fundamental role in theoretical 
physics. In this chapter and the next one, we shall discuss the more familiar 
subject of global symmetry. The notion of local gauge symmetry with its 
space—time-dependent transformation will be introduced in Chapter 8. Such 
gauge symmetries can be used to generate dynamics, the gauge interactions. 
The natural mathematical language of symmetry is group theory. After the 
development of quark models and non-Abelian gauge theories of strong and 
electroweak interactions, some knowledge of Lie groups has become 
indispensable for anyone interested in the study of elementary particle 
theory. Here we shall present a practical introduction to the subject. It begins 
with a mathematical preliminary section composed mostly of definitions and 
illustrative examples. Our approach is informal. The basic notions intro- 
duced here are for group theory as applied in practice in particle physics. The 
groups SU(2) and SU(3) are studied with elementary techniques and 
supplemented with graphic methods in §4.2. The tensor method which is 
appropriate for the general SU() groups is presented in §4.3. The physical 
realization of the flavour symmetry SU(3) of strong interactions is the quark 
model which is briefly studied in §4.4. 


4.1 Elements of group theory 


A group G is a set of elements (a, b, c,...) with a multiplication law having 
the following properties. 


(i) Closure. If a and b are in G, c = ab is also in G; 
(ii) Associative. a(bc) = (ab)c; 
(iii) Identity. There exists an element e such that ea = ae = a for every a 


in G; 
(iv) Inverse. For every a in G, there exists an element a~! such that 
aa t =a ta =e. 


Also, if the multiplication is commutative—ab = ba for all a and b in G, G is 
an Abelian group. If the number of elements in G is finite, it is a finite group. A 
subgroup is a subset of G, which also forms a group. 

Here are some examples. The cyclic group of order n, Z„, consists of a, a’, 
a”, ..., a" = e (identity). It is a finite Abelian group. The symmetric group (or 
permutation group), S,,, being the set of all permutations of n objects is a finite 
non-Abelian group. The unitary group, U(n), is the set of n x n unitary 
matrices: UU? = U'U = 1. It is non-Abelian for n > 1. The Abelian group 
U(1) consists of 1 x 1 unitary matrices, i.e. they are phase transformations 
e°, The group of n x n unitary matrices with a unit determinant is called the 


3 
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special unitary group, SU(n). Similarly, SO(n) is the group of nxn 
orthogonal matrices: AA’ = A'A = 1 with unit determinant. Thus SO(3) is 
just the familiar rotational group. 

Given any two groups G = {g,,g2,...} and H = {h,,hp,...}, if the gjs 
commute with the /,s, we can define a direct-product group G x H = (g;h;} 
with the multiplication law 


Jhi 'Imhn = Ik]m' hih,- (4.1) 


Examples of direct-product groups are SU(2) x U(1) (the group consists of 
elements which are direct products of SU(2) matrices and the U(1) phase 
factor) and SU(3) x SU(3) (the group consists of elements which are direct 
products of matrices of two different SU(3)s). These groups will play an 
important role in the application of group theory in particle physics (see 
Chapters 5 and 11). If we can write a group as a direct product of smaller 
groups, the study of group structure will be greatly simplified. To see whether 
this decomposition is possible, it is useful to introduce the notion of an 
invariant subgroup, which is the subgroup N such that for any element t in N 
then rtr™t is still in N for all r in G. Thus each component of a direct- 
product group is an invariant subgroup. If the group does not contain any 
non-trivial invariant subgroup, i.e. it cannot be written as a direct-product 
group, it is a simple group. SU(n) is such an example, but U(n) is not because 
it can be written as SU(n) x U(1). The groups which are a direct product of 
simple groups without any Abelian factors are called semi-simple groups. 

A representation is a specific realization of the multiplication of the group 
elements by matrices. Thus, it is a mapping of the abstract group elements to 
a set of matrices a —> D(a) such that, if ab = c, then D(a)D(b) = D(c), i.e. the 
group multiplications are preserved. Thus properly speaking the above 
definitions of the groups U(n) and SU(n) are given in terms of their defining 
representations. Also note that the permutation operations of S, may be 
represented by a finite number of x n matrices. If a representation D(a) can 
be put in block-diagonal form, i.e. if there exists a non-singular matrix M, 
independent of the group elements, such that 


D,(a) 0 
MD(a)M~'! = D,(a) forallainG, (4.2) 
0 l 


D(a) is called a reducible representation. It is denoted by a direct sum 
D(a) ® D,@.... If this cannot be done, D(a) is said to be irreducible. We 
can consider the matrices D(a) as linear transformations on a set of basis (or 
state) vectors. The dimension of a representation is just the dimension of the 
vector space on which it acts. The reducible representation means that a 
subset of states is never connected to other states and in irreducible 
representations all states are connected with each other through group 
transformations. 

Of particular relevance to physical applications are the Lie groups, which 
we shall first define narrowly as continuous groups (having elements labelled 


88 Group theory and the quark model 4.1 


by continuous parameters such as the Euler angles for the rotation 
group SO(3)) with representations by unitary operators. Let 
a(0) = a(6,, 02, ..., 0„,) be the group elements labelled by n continuous real 
parameters. The identity element is taken to be e = a(0). The group 
multiplication a(@)a(@) = a(€) corresponds to the mapping of the parameter 
space on to itself 


£(0,) =§ (4.3) 


which satisfies the requirements of 


f(0, ©) = £(0,0) = 6, £0, f(b, §)) = ££, p), 5), (4.4) 


and £(0, 0’) = 0 if a(@)~* is parametrized as a(6’). This is a Lie group if the 
function fin (4.3) is an analytic function (or continuously differentiable) with 
respect to its variables. Thus we can use the usual analytic methods in 
abstract group space when dealing with Lie groups. Also, since trans- 
formations in quantum mechanics are unitary operators in Hilbert space we 
are particularly interested in those Lie groups with unitary representations 


a(8) = exp{i0-X} = a(0) + 10,%, +... (4.5) 
where 
pe ie (4.6) 
' 06;\e=0 | 


are called the (infinitesimal) group generators. For unitary a(@), the X, are a 
set of linearly independent hermitian operators. For example, when a(@) is an 
element of the SO(2) group, the group of two-dimensional rotations, the 
generator is simply the Pauli matrix 


tena 4.7 
EN a o (4.7) 


Define the commutator of two group elements a(ġ) and a(0), lying near 
the identity, as a()a(®)a(h)~ 'a(®)~*. This product should also be a group 
element, call it a(&). € must be a function of @ and 9, 


ci = 9i(0,) with g(0, o) = g(0, 0) =0. (4.8) 
For small 8 and @ we can expand g;(0, @) in powers of 0; and @;, 
é! = A! + BLO, + Bild, + Cibo + C100, + Cbd, t... 
The boundary conditions in (4.8) imply that 
A = Bi = B= Ch, = Ch = 0 


J 
or 
= C9 Ds Teit (4.9) 
When we equate 


al) =e +1¢X, +... (4.10a) 
to 
a(þ)a(@)a(p) ta(0) t = e + 0p [X X] +... (4.10b) 
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we have the Lie algebra 
[ya aCe: (4.11) 


The Cis, called the structure constants of the group, are a set of real numbers 
with 


C= =Cl,. (4.12) 


For example, the generators of the rotation group in three dimensions SO(3) 
are just the angular momentum operators J,, Ja, and J,. They satisfy the 
commutation relation 


LJ, J] = i&u, (4.13) 


where €,,, is the totally antisymmetric Levi-Civita symbol with £423 = 1. 

If the D(a)s form a representation of the group, the D*(a)s form the 
complex conjugate representation, since D(a,)D(a,) = D(a,a,) implies 
D*(a,)D*(a,) = D*(a,a,). From (4.6) we have the representation matrix of 
the generators 7(X;) = 7;, 


D(a(®)) = exp{i0-T} (4.14) 
with 
[T T] = iC}. (4.15) 


Clearly the — 77s also form a representation of the generators. If T, and 
— T¥ are equivalent, i.e. if there exists a nonsingular matrix S such that 


ST,;S~* = —T* for all, (4.16) 


then the T; is called a real representation. As we shall see below in §4.2, all 
irreducible representations of SU(2) are real; some properties of real 
representations will also be discussed in §4.2. 

From the Jacobi identity 


LX}, [X M1] + Lh, LX), MI + DG, X, X] = 0 (4.17) 
and (4.11), we have the relation among structure constants 
Cin + Chj Ckm + CCm = Q. (4.18) 
We can define a set of matrices 
Ch = I(T) (4.19) 


which satisfies the commutation relation of (4.15). Thus the structure 
constants also generate a representation of the algebra, the adjoint represen- 
tation. It has dimension equal to the number of real parameters necessary to 
specify a group element. 

For the semi-simple group (i.e. one having no U(1) invariant subgroup) a 
normalization convention of the 7js that is compatible with the nonlinear 
commutation relation (4.15) 1s 


tr(T;T;) = 26; (4.20) 


because tr(7;T;) is a real symmetric matrix and can be diagonalized by 
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taking an appropriately chosen real linear combination of the generators. The 
diagonal coefficients have been set to a constant J. With this basis in the 
vector space of the generators, the structure constants may be written 


jk = > tr(T,,.L7;, Ti.) (4.21) 


which implies that C% is totally antisymmetric in all three indices. 

Because the representation matrices of the group elements and their 
generators are related by exponentiation (4.14), many of their properties can 
be directly translated into one another. Trivially, they have the same 
dimension, etc. In the following, unless the ambiguity makes a difference, the 
term ‘representation’ will mean either that of the group elements or their 
generators. Also, the set of basis states of the representation is sometimes 
referred to, for brevity, as the representation. 


4.2 SU(2) and SU(3) 


The special unitary groups SU(n) are encountered repeatedly in particle 
physics theories. It is SU(2) in isospin invariance; SU(3) in ‘the eightfold 
way’; the standard gauge model of strong and electroweak interactions uses 
SU(3) x SU(2) x U(1); the simplest grand unification group is SU(5). In this 
section we shall concentrate on groups SU(2) and SU(3). The subject of the 
tensor method in SU(n) is presented in §4.3. 

SU(n) is the group of n x n unitary matrices with unit determinant: 
UU = UU' = 1 and det U = 1. Any unitary matrix U can be written in 
terms of a hermitian matrix H as U = e”. From the identity det(e*) = e'"4 
and det U = 1, it follows that tr H = 0. Since there are n? — 1 traceless 
hermitian n x n matrices, an element of SU(n) can be written as 


n2—1 
U = exp ` slat (4.22) 
a=1 


where the ¢,S are (real) group parameters. The J,s are group generators 
represented by traceless hermitian matrices. Only n — 1 of n? — 1 generators 
are diagonal. We say SU(n) is a group of rank n — 1. 


The SU(2) group 


There are three group parameters. We write the 2 x 2 unitary unimodular 
matrices as 


UE, 9 E2, E3) a exp{ig,o, } (4.23) 


where the a, are 2 x 2 traceless hermitian matrices. We choose the basis to 
be the standard Pauli matrices. 


4.2 SU(2) and SU(3) 91 


The generators defined by J; = o;/2 will give the commutation relation 
[J Ji] = lEaned (4.24) 


where £a 1S the totally antisymmetric Levi—Civita symbol and ¢,,3 = 1. We 
then abstract this as the general Lie algebra of SU(2) and all representations 
of the generators satisfy this set of commutation relations. 


SU(2) representations. The algebra (4.24) is the same as that in (4.13). We 
say SU(2) is isomorphic to the rotation group SO(3). The standard method 
of setting up angular momentum eigenstates will be followed here to get all 
the irreducible representations of SU(2). 

First define 


P=} ++ (4.25) 


which is an invariant operator, a Casimir operator, commuting with all the 
generators of the group 


[J?, J,] = 0, ä= 1,2,3. (4.26) 
Also define the raising and lowering operators 
Jad tih (4.27) 
then 
J? = 4J, J- + J-J}) + J2. (4.28) 
We have from (4.24) 
CJugdo t= 2d, (4.29) 
[J h] = FJ. (4.30) 


Consider an eigenstate of J? and J; with eigenvalues 4 and m 
J?|A, mò = Ala, mò 
J3|A,m> = mld, m}. (4.31) 


Because of (4.30) the states J,|A,m)> are also eigenstates of J, with 
eigenvalues m + 1, and, because of (4.26), the same eigenvalue 4 


J,\4, my = C,(4, m)|A, m + 1) (4.32) 


where the C',(A, m)s are constants to be determined later. For a given A, 
values of m are bounded 


A—m?>0 (4.33) 
because J? — J4 = Ji + J} > 0. Let j be the largest value of m 

Tapn: (4.34) 
Eqns (4.34), (4.28), and (4.29) then imply 


0 = J-J; |4, j> 
= (J? — J3 — Aaj =A-P DA Y (4.35) 
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or 

A=jUt+1). (4.36) 
Similarly, let 7’ be the smallest value of m 

J_\A,j'> =0. (4.37) 
We obtain 

A=) GG = 1). (4.38) 


Thus j(j + 1)=j'(j'— 1) which has the solutions j’ = —j and j’ =j +1. 
Since second solution violates the assumption that j is the largest value of m, 
we have 


j = j. (4.39) 


Since J_ lowers the value of m by one unit, j — j’ = 2j must be an interger. 
This means that j can be either an integer or half-integer. To determine 
C(å, m) in (4.32) we use 


<A, mJ- J+A, m> = |C; (4, m)’. (4.40) 


because J. = Jt, implies that <4, m| J- = C%(A,m)<A,m-+ 1|. We also 
have, from (4.35) 


<A, mJ _J4|A, m> = <A, m|(J? = J3 — J3)là, m> 


=j(j+1)- m — m. (4.41) 
Hence, 
C,(à,m) = [(j— mj + m + 1). (4.42) 
Similarly, 
C_(A,m)=[G+m)j —m+4+1)]'”. (4.43) 


These states |j, mò with m=j,j—1,..., —j form the basis of an SU(2) 
irreducible representation, characterized by j which is either an integer or 
half-integer. Thus the dimension of the representation is 2j + 1. We can use 
the relations 


Jalj, m> = mlj, my 
Jlj, m = [0 F m) + m + 1)]"?j, m + 1 (4.44) 


to work out the representation matrices. 


Example 1. J =4, m= +3 


Ibo = 45.3. (4.45) 


1 0 
bp=(o) B-b=()) (4.46) 


1 0 
n=4( i): (4.47) 


If we denote 


then 
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From J 4|3,3> = 0 and J; lz, —3> = lż, 2> we have 


Also, 


O 1 0 =i 
Ji = (J, +J-)2=4 ,  Jg= (J, -J yYä=4. V 
1 0 i 0 
(4.50) 
Example 2. J = 1, m = 1,0, — 1. 
Denote 
l 0 0 
IL D=| OF IL,O=į{ 1 7 Il, —1> =f 0b (4.51) 
0 0 
Then 
1 0 0 
J,={ 0 0 0 |. (4.52) 
0 0 -1 


From J,|1,1> = 0, J}|1,0ò = /2|1,1>, and J,|1, — 1> = /2/1,0>, we 
have 


0 a2 0 
J=ļ0 0 2f (4.53) 
0 0 0 
Then 
0 0 0 
J -= { /2 0 0 (4.54) 
0 /2 0 
0 1 0 0 —i 0 
jes 1 0 1 je i 0 -i }- (4.55) 
2 ye 
0 1 0 0 i 0 


It is straightforward to check that the J;s satisfy the Lie algebra of (4.24). 


SU(2) product representations. In applications, we often need to deal with 
product representations. For example, if we have two spin 1/2 particles, we 
want to know the total spin J of the product of the two wavefunctions. In 
this simple case, the answer is J = 0 or 1. Let us study this case in terms of 
group theory. Denote the spin-up and spin-down wavefunctions of the first 
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particle by r; and r,. Similarly denote those of the second particle by s, 
and s,. Under SU(2) transformation 


r; = Ule)irji, Sk = UE) (4.56) 
where U(e) = exp{ie,J,} and J, = o,/2. Then the product will transform as 
(riS) = U (&); ,U (E€) (r ;S1) = D(E)ix, (7 j51)- (4.57) 


Generally D(e) is reducible. To see what irreducible representation it 
decomposes into, it is easier to work with the generators directly by taking 
&<«l. 


r; = (1 + ie,J,),r; = (1 + ieJ Pir; 
sp = (1 + ieJ aasi = (1 + ied Pys; (4.58) 


where J® operates only on r; and does not affect s;; J'?) operates only on s; 
and not on r;. Define the total angular momentum operator as 


J= JO 4 J, (4.59) 


We now change to the more familiar notation. Let «œ; denote the spin-up 
wavefunction of the ith particle and f; the spin-down wavefunction. There 
are four combinations of two-particle wavefunctions: «4&2, 0,65, B1%2, B,B>. 
Take the one with the largest value of J, 


Jaia) = (Ja) + a (JIE) 
= (01%). (4.60) 
Clearly it is a state with J, = 1. To find its J value, we use 


J? = (J)? ne (JY 4 2J. J 


= J) 5 (I)? + 266 (IPI 4 JOTI?) + IDI) (4.61) 
to find that 
J (4,05) = 2(a,0). (4.62) 
This means J = 1 and we can make the identification 
1, 1> = (@,a,). 


We use the lowering operator J. = J® + J? to reach all other states of the 
J = | irreducible representation 


J_(040) = (JP a) + (JOa) 


= (1&2 + &ı b2). (4.63) 
On the other hand, using eqn (4.44), we get 
J-M, 1 = v21, 0>. (4.64) 
Thus, 


l 


(xiba + Bia). (4.65) 
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Obviously, 
|1, —1> = Bi Bp. (4.66) 
The remaining independent state must be identified as 
l 
|0, 0> = V2 («1B — B122). (4.67) 


Again, we can check this assignment by applying eqn (4.61). In short, the 
two-particle wavefunctions can be organized as 


|1, 1> = X12 
l 
|1, 0> = WA (xıb2 + 122) (4.68) 
[1, ~~ 1> = BiB, 
which is symmetric under the interchange of particles 1 <> 2, and 
l 
|0, 0> = J2 (xıb2 — P12) (4.69) 


which is antisymmetric under | «2. 
More generally the product representations |j,,m,> x |j,, m,> can be 
combined into eigenstates |J, MY of total J = J@ + J” 


|J, Mò = > Chm j2ma|JM >|jym,>|j2m2>. (4.70) 
m,,M14 
The coefficients <j,m,j,m,|JM > are called the Clebsch—Gordon coefficients. 
Thus for the above case (eqns (4.68) and (4.69)) we have 
1 
GHIL D=1, GQ -$4-411 0) =<, ete. 
The procedure of working out the irreducible representations of the 
product representations can be summarized as follows. 


(1) Start with the combination of states with the largest J}. This is also the 
state with the largest total J. 

(2) Use the lowering operator J. = J“) + JØ to get to all the other states 
in the same irreducible representation. 

(3) Find the orthogonal combination to |J,,, J,,-—1> where J,, is the 
maximum value of J in the product. This should be the state |J/,,— 1, 
J,, — 1>. Then use the lowering operator to reach the other J = (J,, — 1) 
states. 

(4) Repeat these steps until J = |j, — jal. 


We can also graphically represent SU(2) representations. The group is 
rank 1, i.e. it has one diagonal generator; each irreducible representation j 
can be characterized by a straight-line segment with points on it denoting 
values of m (see Fig. 4.1). In a product representation the eigenvalues of the 
diagonal generators J®? and J¥? are additive. We can represent this addition 
graphically by repeatedly placing the centre (m = 0) of one representation, 
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say j, over on every point of the other, j,, representation (see examples in 
Fig. 4.2). As we shall see, this graphical method can be generalized to the 
rank-2 group of SU(3), where the results are less trivial and difficult to see 
without such a diagramatical aid. 


= 1 L 
=l 0 +1 
s3 l 4 3 
2 2 2 
a) 
j= p N E 


Fic. 4.1. Graphical representation of SU(2) multiplets. The raising (lowering) operator J,(J_) 
moves a state to the right (left). 


l l 1 x——* 
1 l -2 2 2 on 
7® 7 = *— Q) I= %——K 
X 
= x< = 01 @ 1 
XX 
i L : X 
7®@ 1 = X Q) ~ = ~——_ i y} 
= XX cae, il 
pag, E 
FIG. 4.2. 


The Reality property of SU(2) representations. We shall denote the repre- 
sentation matrices of the generator by 7(J,) = T,. As we already mentioned 
in §4.1, SU(2) has the property that all its representations are real, i.e. there 
is a (fixed) matrix S such that 


ST,S~' = —T*. (4.71) 
For example, in the defining representation 7, = 0,/2 we have —o* = —o,, 
—o% = c, and —o} = —o;3. The reality condition (4.71) can be satisfied 


with S = o,. In general the eigenvalues of diagonal generators change sign as 
we go from T, to — 7* because the 7,s are hermitian with real eigenvalues. 
The eigenvalues of — 7* are precisely the negatives of those for 7,. In SU(2) 
all the irreducible representations have the property that their J; eigenvalues 
occur in pairs, i.e. m= +j, +(7—1),.... This is why they are real 
representations; — 7* can be obtained from T, by changing the basis from 
|j, mò to |j, —m). For example, in the j = 1 representation, the |1, 1> and 
|l, —1> states of (4.51) are interchanged, leaving |1,0> invariant, by the 
transformation 


0 0 | 
S={0 —1 0 (4.72) 
l 0 0 


and we can easily check that (4.71) is satisfied. Clearly, it is a general property 
of any group representation that, if one of the diagonal generators does not 
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have eigenvalues in pairs of opposite signs, then that representation is not 
real. 

We state once again some of the SU(2) properties which the reader should 
keep in mind when studying the less familiar SU(3) group. 


(1) Only the representation matrix of J; is diagonal; SU(2) is a rank-1 
group. 

(2) The irreducible representation labelled by j (dimension 2j + 1) has 
basis states |j, mò 


J*\j,m> =j + Ili, m> 
Jlj, mò = mlj, m). 
(3) States with different values of m are connected through the raising and 
lowering operators 
Jlj, m =[0 F mG +m + DI’, m1). 


(4) Each irreducible representation can be pictured by a one-dimensional 
graph because of (1), with equally spaced points representing the 2j + 1 
states. The T, operator moves these points along the line. 

—T_ Ia 
er e ae 
=f aed oe J 
Products of two representations j; and j, can be obtained simply by placing 
the first representation line 27, + 1 times over the second representation line, 
with the m, = 0 centres coinciding with each state of the j,-representation. 


The SU(3) group 


There are eight group parameters. For the defining representation we write 
the 3 x 3 unitary unimodular matrices 


U(é,,..., &3) = expfie,, } a=1,...,8. (4.73) 


The 1,s are 3 x 3 traceless hermitian matrices, which may be chosen to have 
the form (Gell-Mann 1962a) 


0 1 0 0 -1 l 0 0 
0 0 0 0 0 0 0 
0 0 1 0 0 -1 
Is=10 00) 4=ļ00 0 
1 0 0 i 0 0 
0 0 0 0 0 0 1 0 
rato 01) 24,=]0 0 ai ds = 0 1 0f 474) 
0 1 0 0 i 0 0 0 — 
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They have the normalization 


tr(A,,) = 20 ab (4.75) 
and satisfy the commutation relation 
Àa A, : A, 
E a = İfabe > (4.76) 


Save 1S totally antisymmetric with nonvanishing members, 


hi23 me l, fia7 = 1/2, fise a — 1/2, J246 E 1/2, frs7 = 1/2, 


345 7 1/2, he7 FN 1/2, Jass = V3/2, feng = V3/2 (4.77) 
The generators F, of SU(3) satisfy the Lie algebra 
Ls F, | =] abe ee (4.78) 


We can follow the pattern of the SU(2) procedure to obtain irreducible 
representations of SU(3). Here we follow the presentation of Gasiorowicz 
(1966). 

SU(3) is a rank-2 group; since A; and A, are both diagonal, 


[F,, Fg] = (Q. (4.79) 
F and F; can be diagonalized simultaneously. We define the following raising 
and lowering operators 
lis Peak Ye U, = kā tik, Vi = Fy ik. 


We also define 
2 
= — Fy. 
J3* 
In terms of these operators, the communication relations of (4.78) can be 
written as 


T; = Fy, Y (4.80) 


[73,7,]= +T} LY, T,]=0 
[T;, U4] = ¥12U,  [Y,U,]= +U; 
[T3, V] = +1/2V;  [Y,V;]= +V; (4.81) 
[T,, T_] = 27; 
[U,,U_] =3/2Y — T; =2U; 
[V., V] = 3/2Y + T; = 2V; (4.82) 
[T+, V+] = [T+, V-] = [LU+, V+] = 0 
Lrs Vals =U [T+, U4] = V} 
[U,,V_]=T.. [T;, Y]=0. (4.83) 


SU(3) representations. Since T, and Y can be diagonalized simultaneously, 
the states in an SU(3) irreducible representation must be labelled by two 
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eigenvalues: t, and y. A representation is then pictured as a two-dimensional 
figure on the t,-y plane, just as an SU(2) representation is an one-dimen- 
sional line (Fig. 4.2). From the commutation relations in eqns (4.81) 
(4.83), it is not difficult to see the results of raising and lowering operators 
acting on the states: 


T, raises t, by 1 unit and leaves y unchanged; 
U, lowers t, by 1/2 unit and raises y by 1 unit; 
V, raises t, by 1/2 unit and raises y by 1 unit, etc. (4.84) 


If the units of t, and y are appropriately scaled in the graph, these raising and 
lowering operators connect points along lines that are multiples of 60° with 
each other (Fig. 4.3). 


Fic. 4.3. 


Each irreducible representation of SU(3) is characterized by a set of two 
integers (p,q). Graphically it shows up as a figure with a hexagonal 
boundary on the ¢,—y plane: three sides having p units of length and the other 
three sides having q units (see Fig. 4.4(a)); the hexagon collapses into a 
equilateral triangle when either p or q vanishes (Fig. 4.4(b)). The boundary is 
symmetric under reflections in the y-axis. We recall that an SU(2) irreducible 
representation is characterized by one integer j; graphically it is a straight line 
of 27 units of length. There are 27 + 1 sites, each of them singly occupied by 
one state. For the SU(3) representation (p, q) the multiplicity of states on 
each site in the ¢,—-y plane form the following pattern: the sites in the 
boundary are singly occupied, on the next layer they are doubly occupied, on 
the third layer triply occupied, etc., until a triangle layer is reached beyond 


(b) 
Fic. 4.4. Boundaries of the SU(3) representation (p, q), (p, 0), and (0, q). 


100 Group theory and the quark model 4.2 


which the multiplicity ceases to increase and remains q + 1 for p >q (or 
p+ 1 for q> p). 

The procedures used to deduce these properties of the irreducible 
representation from the commutation relations are all similar. We shall 
present one such proof to illustrate the general algebraic technique. To show 
that the boundary layer is singly occupied, take two neighbouring states |A» 
and |B» on the boundary shown in Fig. 4.5. Thus 


U_|A»> = |B}. (4.85) 


We need to show that, given |A), the state |B» is unique regardless of the path 
taken to go from |A> to |B). Consider an alternative path ACB; we have 


V_T,|A> = (LV_, T, ] + T} V_)|A> = U_|A) = |B) (4.86) 
where we have used V_|A»> = 0 and (4.83). 


Fic. 4.5. A typical representation with (p, q) = (5, 1). Multiplicity of states at each site is 
indicated by the crosses. Wa, is the t = t3 = (p + q)/2 = 3 state. 


max 


It is not difficult to convince oneself that the result holds independently of 
the path taken to go from |A> to |B); hence, given |4), the state |B» is 
unique. Since the state of maximum eigenvalue of 73 is unique and resides on 
the boundary, all boundary sites are singly occupied. 

Once the multiplicity of states at each site is given, we can add them up. 
This sum is the dimension of the irreducible representation. To do this we 
start with counting the number of sites in the inner triangle which has sites 


p—q 
p—q+1 | 
5 [=n pah Dag): (4.87) 
l=1 


Here the multiplicity is (q + 1). On the next outer layer there are 
3(p — q + 2) sites with multiplicity q; on the next one, 3(p — q + 4) sites 
each with (q — 1) states, etc. Thus the dimension is equal to 


q 


| 
59+ Wp -q+ Wp -q + 2) + 3 3(q —v)\(p —q + 2v +2) 
g (4.88) 
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or 
Ap, 9g) =(p + Ig + I(p + q + 2)/2. (4.89) 


Instead of labelling an irreducible representation by (p,q), another 
common practice is to denote it by its dimensionality. Thus an m- 
dimensional irreducible representation is labelled by m and its complex 
conjugate by m*. Some of the more important representations are shown in 


(p,q)=(0,1), 3* (triplet) (p,q)=(1,0), 3 (triplet) 


2 yY 


(p.q)=(1,1), 8(octet) (p.q)=(3.0), 10(decuplet ) 


Fic. 4.6. Examples of SU(3) representations with states labelled by (t;, y). Here all sites are 
singly occupied except the centre of 8: one is a t = 0 state; another is the t, = 0 member of a 
t = | triplet. 


One more remark about the graphical representation (p, q). Since there are 
generally several states for a given value of (t3, y), at a given site we need 
further labelling to distinguish the different states. For this we can specify the 
SU(2) subgroup to which they belong. A convenient choice will be the 7-spin 
value ¢. There are p+ 1 sites each singly occupied on the top line, 
corresponding to t= p/2. The next line has two T-spin multiplets: 
ti = (p + 1)/2 and t, = (p — 1)/2. etc. Also since the widest portion of the 
hexagon has width (p + q) we conclude that 


max = (p + q)/2. (4.90) 


For the product representation we can follow a procedure similar to that 
for the SU(2) group. The method of using the raising and lowering operators 
gives not only the decomposition of the product representations but also the 
Clebsch—Gordon coefficients. But this method is rather tedious as there are 
quite a few raising and lowering operators in SU(3). If we are interested only 
in the decomposition, we can use the simple graphical method. Again we will 
place one representation figure on top of each member state of the second 
representation: the centre (t, = y = 0) of the first one coinciding with the site 
of each state of the second representation. The simplest case of 
3 x 3* = 8 + 1 is illustrated in Fig. 4.7. The more systematic approach of 
the tensor method will be presented in the next section. 
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Fic. 4.7. 


4.3 The tensor method in SU(n) 


The analysis of SU(2) and SU(3) in the last section shows that, as the 
group gets larger, the elementary techniques used to dissect the represen- 
tation structure and product become inadequate. For the SU(4) group, 
which is rank-3, the irreducible representations have to be pictured in a three- 
dimensional plot and one would need a keen spatial sense to work out the 
decomposition of the product representation. This approach becomes rather 
hopelessly complicated for groups of rank-4 or higher. Clearly one needs a 
more efficient approach. The tensor method turns out to be particularly 
appropriate for the study of irreducible representations and the de- 
composition of the product representations in the general SU(n) group. 


Transformation law of tensors 


The SU(n) group consists of n x n unitary matrices with unit determinant. 
We can regard these matrices as linear transformations in an n-dimensional 
complex vector space C,. Thus any vector W; = (W,,W.2,..., Ya) in C, is 
mapped by an SU(n) transformation U;,; as 


Wir Yi = Ui jW;. (4.91) 


The ws also belong to C,, with UU' = U'U = 1 and det U = 1. Clearly for 
any two vectors we can define a scalar product 


(V, p) = WFO; (4.92) 
which is an SU(n) invariant. The transformation law for the conjugate vector 
is given by 

Wt > Wit = Ubbt = WU}. (4.93) 


It is convenient to introduce upper and lower indices 
Wi= i, UJ =U, and Ui =U*. (4.94) 


Thus complex conjugation just changes the lower indices to upper ones, and 
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vice versa. In this notation, eqns (4.91) and (4.93) read 


Wi a Wi = U/W, 
yi > = UW, (4.95) 
The SU(n) invariant scalar product is 
M, p) = Woi. (4.96) 
and the unitarity condition becomes 
U',U*, = J, (4.97) 
where the Kronecker delta is defined as 
ee fi=j 
Ea ‘0 otherwise (529) 


Note that in this notation the summation is always over a pair of upper and 
lower indices. We call this a contraction of indices. The w;s are the basis for 
the SU(n) defining representation (also called the fundamental or vector 
representation and denoted as n), while the y's are the basis for the conjugate 
representation, n*. 

Higher-rank tensors are defined as those quantities which have the same 
transformation properties as the direct products of vectors. Thus tensors 
generally have both upper and lower indices with the transformation law 


Hae = (Ug Uz... Ug NU U eU i®. (4.99) 
They correspond to the basis for higher-dimensional representations. 
The Kronecker delta and Levi-Civita symbol are invariant tensors under 
SU(n) transformations. They play important role in the study of irreducible 
tensors. 


(1) From the unitarity condition of (4.97) we immediately have 
ôi = UU; ô. (4.100) 


Hence ĝi is an invariant tensor. Generally, contracting indices with the 
Kronecker delta will produce a tensor of low rank. For example, 
SpE = (4.101) 
We can regard the right-hand side as the trace between the pair of indices, in 
this case i, and j,. Also, a tensor with all its indices contracted yį1;2::: i2 is an 
SU(n) invariant scalar. 
(2) The Levi-Civita symbol is defined as the totally antisymmetric 
quantity 
l if (ii, ..., i) is an even permutation of (1,..., n) 
gfi in S ena = 9 —Lif (i, i) is an odd permutation of (1,..., n) 
0 otherwise. (4.102) 
It is an invariant tensor 
E; = UU... Uje 


EA ty 


jij2---jn 


izreke Sitrin (4.103) 
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where we have used the definition of determinant and det U = 1 for SU(n). 
Similar to the contraction by 6/ we can sum over indices by using the Levi- 
Civita symbol. For example, 


ND i es (4.104) 


Thus tensors with upper indices can be constructed from those with lower 
indices, and vice versa 


yii = (e2: impa p (e Ap). (4.105) 


So, in principle, to study the transformation properties of the tensor we need 
work with a tensor having only upper (or lower) indices. 


In this connection we also note that 
y= Epi. iQ? im (4.106) 


is an SU(n)-invariant scalar. And eqns (4.103) and (4.106) imply that a totally 
antisymmetric tensor of rank n is invariant under SU(n) transformations. 


Irreducible representations and the Young tableaux 


Generally the tensors we have just defined are bases for reducible represen- 
tations of SU(n). To decompose them into irreducible representations we use 
the following property of these tensors. The permutation of upper (or lower) 
indices commutes with the group transformations, as the latter consist of 
identical U;;s (or Ui s). We will illustrate this with the following example. 
Consider the second-rank tensor y;; whose transformation is given by 


Ww = UiUjy". (4.107) 

Since the Us are the same, we can relabel the indices 
yË = UU Ly = ULUjy". (4.108) 
Thus the permutation of the indices does not change the transformation law. 


If P,, is the permutation operator which interchanges the first two indices 
P,.W = yY”, then P,;, commutes with the group transformation 


P y = U UIP ai (4.109) 


This property can be used to decompose Ņ*# as follows. First we form 
eigenstates of the permutation operator P,, by symmetrization or 
antisymmetrization, 


y_i ij ji y_i ij _ yji 
Sia sty), Ava SW y). 


Thus, 
P, S” — SY, P, AË a — AÏ, (4.110) 


It is clear that SY and A” will not mix under the group transformation 


si=utiuis®, Al suiuiA®. (4.111) 
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This shows that the second-rank tensor Y} decomposes into S and A” in 
such a way that group transformations never mix parts with different 
symmetries. It turns out SY and A cannot be decomposed any further and 
they thus form the basics for irreducible representations of SU(n). This can 
be generalized to tensors of higher rank (hence the possibility of mixed 
symmetries) with the result that the basis for irreducible representations of 
SU(n) correspond to tensors with definite permutation symmetry among (the 
positions of) its indices. The task of finding irreducible tensors of an 
arbitrary rank f (i.e. number of upper indices) involves forming a complete 
set of permutation operations on these indices. The problem of finding the 
irreducible representation of the permutation group has a complete solution 
in terms of the Young tableaux. They are pictorial representations of the 
permutation operations of f objects as a set of f-boxes each with an index 
number in it. For example, for the second-rank tensors, the symmetrization 
of indices i and j in $; is represented by [i[j]; the antisymmetrization 


operation in A;, is represented by H. For the third-rank tensors, we have 


Ea 
|i {7 |] in the case of the completely symmetric S;,,, in the totally 
antisymmetric A;,,, and for the tensor with mixed symmetry 
k 


Vije = Yije + Win — Vici — rji 
A general Young tableau is shown in Fig. 4.8. It is an arrangement of f boxes 
in rows and columns such that the length of rows should not increase from 


top to bottom: jh 2h 2... and f +f +... =f Each box has an index 
i, = 1,2,...,n. To this tableau we associate the tensor 
Vii TEN ifyiify+is...,ipptyo,... (4.112) 


with the following properties. 


(1) Indices appearing in the same row of the tableau are first subject to 
symmetrization. 

(2) Subsequent indices appearing in the same column are subject to 
antisymmetrization. 


A tableau where the index numbers do not decrease when going from left to 
right in a row and always increase from top to bottom is a standard tableau. 


For example, the nm=3 mixed-symmetry tensor has the 
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following standard tableaux 
BERG bt 
2 3 


while tableaux such as |111], 1213], and are not standard. 


The non-standard tableaux give tensors that, by symmetrization or 
antisymmetrization, either vanish or are not independent of the standard 
tableaux. Thus for a given pattern of the Young tableaux the number of 
independent tensors is equal to the number of standard tableaux which can 
be formed. It is not hard to see that this number for the simplest case of a 
tensor with k antisymmetric indices is 


k n\ nn—1l)...a~—k +1) 
7 NK) iek (4.113) 


and that for a tensor with k symmetric indices the number is 


EERE (oiT ) mee eee. 


<a k 1.2...k 


One should note that because of antisymmetrization there are not more 
than n rows in any Young tableau. Also, if there are n rows, we can use 
E;,i,...i, tO contract the indices in the columns with n entries. Pictorially we 


can simply cross out any column with n rows (see, for example, eqns (4.123) 
and (4.125)). 


(4.114) 


Fundamental theorem (See, for example, Hammermesh 1963.) A tensor 
corresponding to the Young tableau of a given pattern forms the basis of an 
irreducible representation of SU(n). Moreover if we enumerate all possible 
Young tableaux under the restriction that there should be no more than 
n — | rows, the corresponding tensors form a complete set, in the sense that 
all finite-dimensional irreducible representations of the group are counted 
only once. 

We next give two formulae of the dimensionality of irreducible represen- 
tations. If the Young tableau is characterized by the length of its rows 
(fi, fo,---,fr-1), define the length differences of adjacent rows as 
A, =hi —h» a. =fz — f3» ---,ÅAn-1 =Sn-1- The dimension of an SU(m) 
irreducible representation will then be the number of standard tableaux for a 
given pattern 


MA, Az, +++, Anz) = A + A) +A)... + An-1) 


Ay tA A, +3 Rice F Apei 
1 Jet ae _„_ 
iat yrs (1+ i 


\ 
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x Te ES tas E EE — j q Anca + Ân- + Anni 
3 3 3 
. (+t tA), (4.115) 


One can easily check that the special results of (4.113) and (4.114) for the 
tableaux (k, 0,0,...) and (0, 0,..., 1, 0,0,...) are recovered. 
ee 


k 


Example 1. SU(2) group. The Young tableaux can have only one row: 
d(A,) = (1 +4,). Thus 2, = 2). It follows that a doublet is pictured as [_ | 
and a triplet as[ [_, etc. 


Example 2. SU(3) group. The Young tableaux can have two rows, hence 
d(A,, 42) = (1 + 4,) + 4,)(1 + (A, + 4,)/2). Thus, 4, =p and 4, =q of 
(4.89). 


CJd,0)3, C1 )(2,0)6, CLIJ (3,0) 10, 


ia (0, 1) 3*, aE (0, 2) 6*, mm (1,1) 8. (4.116) 


The formula (4.115) is rather cumbersome to use for large values of n; in such 
cases the second formulation is perhaps more useful. For this we need to 
introduce two definitions—hook length’ and ‘distance to the first box’. For 
any box in the tableau, draw two perpendicular lines, in the shape of a 
‘hook’, one going to the right and another going downward. The total 
number of boxes that this hook passes, including the originating box itself, is 
the hook length (h;) associated with the ith box. For example, 


ee h; = 3, H h; = l. (4.117) 
A ' 


The distance to the first box (D;) is defined to be the number of steps going 
from the box in the upper left-hand corner of the tableau (the first box) to the 
ith box with each step towards the right counted as +1 unit and each 
downward step as — 1 unit. For example, we have 


KITEA 
zijo (4.118) 


The dimension of the SU(n)-irreducible representation associated with the 
Young tableau is given by 


d = I] (n + D,)/h; (4.119) 


The products are taken over all boxes in the tableau. For example, for the 
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tableau pattern HK , we have hook lengths TG and distances to the first 
1 


box H This yields the dimension d = n(n — 1)(n + 1)/3, which gives 
—Í }. 


d= 8 for n = 3. 


Reduction of the product representations 


One of the most useful applications of the association of SU(n)-irreducible 
representations with the Young tableaux is the decomposition of the product 
representations. To find the irreducible representations in the product of two 
factors, 


(1) In the tableau for the first factor, assign the same symbol, say a, to all 
boxes in the first row, the same b to all the boxes in the second row, etc. 


(4.120) 


(2) Attach boxes labelled by the symbol a to the tableau of the second 
factor in all possible ways, subject to the rules that no two a’s appear in the 
same column and that the resultant graph is still a Young tableau (i.e. the 
length of rows does not increase going from top to bottom and there are not 
more than n rows, etc.). Repeat this process with the bs, ... etc. 

(3) After all symbols have been added to the tableau, these added symbols 
are then read from right to left in the first row, then the second row ..., and 
so forth. This sequence of symbols aabbac... must form a lattice permu- 
tation. Thus, to the left of any symbol there are no fewer a than b and no 
fewer b than c, etc. 


We consider two examples in the SU(3) group. 


Example 1. 
ERE Wma (4.121) 


which corresponds to 
3x3=3* +6. (4.122) 


ae oe 
b 
First step: eee ee a ae 
i lal | 
a| 


Example 2. 
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Second step: | ] Jafa) + + +[T]. 
a a 


(4.123) 


As we have already explained, any column with n boxes in an SU(n) tableau 
can be ‘crossed out’—indicated by a vertical line over the column—thus the 
last three tableaux yield two octets and one singlet. 


Note that tableaux such as[ J Jaļa[b]anda| | Jab] are rejected 
E lal 


because the symbols do not form a lattice permutation. Thus 
8 x 8=14+8+484 10+ 10* +27. (4.124) 


Young tableaux for conjugate representations. If y; and yŻ are the bases for 
the defining representation n and its complex conjugate n*. Clearly w,' is 
SU(n) invariant. It is not difficult to see from the reduction of product 
representation that the Young tableau for the conjugate representation is a 
column of n—1 boxes so that there will be an identity representation 
(a column of n boxes) in the product n x n*. 


as = a 
n=l n mada (4.125) 


In general if we take the Young tableau for a representation wii: and fill the 
boxes such that rectangular tableau of n rows is obtained, the additional 
boxes, when rotated by 180°, form the standard tableau for the complex 
conjugate representation (Y}:::)* = y$:::. For example, in SU(3) we have 6 


as pictured by [|_| which can be filled in as [_[_]. Thus H is the 


Young tableau for 6*. HH 


Group generators in tensor notation 


We first concentrate on the (defining) vector representation and later 
generalize our study to the action of generators on higher-rank tensors. 


(1) Hermitian and real generator matrices. Any n x n unitary unimodular 
matrix U may be written in the form 


U=” (4.126) 
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where H is hermitian and traceless. Normally we choose the group 
parameters to be real. If s, = e*, a = 1, 2, ..., n? — 1, then 


H = £ha (4.127) 


where the 1,8 are n x n hermitian generator matrices. Those for SU(3) have 
been displayed in (4.74). From their commutation relation 


A] a ane 


we can extract the Lie algebra with the identification of 44, = F, as the 
generators 
[Fas Fy) = Yack (4.129) 


where the fibs are the structure constants. For the tensor-method approach 
instead of the form (4.127) we can write the hermitian matrix H as 


Hi = e( W£) (4.130) 
where all indices (a, p, i, j) range from 1 to n. We can choose to have real 
generator matrices which take the form 


(WEY = òu 6° — + 58 6. (4.131) 


The hermiticity condition on H is then satisfied by having the hermitian 
group parameter matrix 


6, See 
Using (4.131) we can work out the commutator 
CWE, W°] = bw? — dew. (4.132) 
The group generators are defined to satisfy the same commutation relation 
(the Lie algebra) 
LFS, Fê] = 68°F! — $F”. 


The structure constants are simply some combinations of the 64s. It is not 
difficult to find the relation between the real generators FÊ and the hermitian 
generators F", 


Fe = Agere - (4.133) 
Eqn (4.133) can be inverted by using the identity, 
n2-—1 1 
5 (A*), (47) 95 = 1 Op, = j Oup i. (4.134) 
a=1 
which can be derived as follows. Since the n x n hermitian traceless matrices 


A", a=1,...,(n? — 1), together with the n x n identity matrix form a 
complete set of n x n hermitian matrices, we can expand an arbitrary 
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hermitian matrix M in terms of them 


n2—1 
M=m1,+ > m,A* (4.135) 
a=1 
where 1, is the n x n identity matrix. We normalize the A*s such that 
tr(A7A") = 26”. 


The coefficients mọ and the m,s in eqn (4.135) can be calculated by 
multiplying matrices of the corresponding bases and taking traces, 


Eqn (4.135) then becomes 


l nE] 
M = : (tr M+ > s (tr MA*)A2 
a=1 
or, in terms of the components, 


1 1 
Mag ain È, M5; O46 Sap sae ` > (Aal hM 5, - 
n y, 6 2 a y,6 


Since M,, is arbitrary, we get 


l l 
Ôn Ôp, S n Oup O65 T 2 X (Aag lA) o 


which is just the identity (4.134). Using eqn (4.134) we can write the real 
generator as 


Fa = LPO): 


where we have used the fact that tr F° = 0. 


(2) Real generators in vector representation. The nondiagonal real gener- 
ators FÊ, a + B, are simply raising and lowering operators. For the defining 
vector representation, W£ has a nonzero element only at the ath row and fth 
column. The infinitesimal SU(m”) transformation on the basis 
Wi > y" =i + ôy’ with ôy’ = e(W2)iW’ shows that 


(Few) = dw’. (4.136) 


Thus, FË will take the ath component of Ww and turn it into the fth 
component, and the result will be zero for all other components. The 
diagonal generators Fžs form a set of mutually commuting operators. Their 
eigenvalues can be used to characterize the basis functions (states) of 
irreducible representations. For example, in the defining vector represen- 
tation, any particular diagonal generator D,, which is some linear combi- 
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nation of the W%s, may be written 
(D$ = (di); ô} (4.137) 
where the (d,);s are the eigenvalues of state y’. 


(3) Real generators on higher rank tensors. The group generator FË acting 
on any tensor is defined as 


ôy = Y’ — y = ict Fp). (4.138) 


Given (4.99), the general transformation law of w with the SU (n) transfor- 
mation factor given by 


UF = ôf + iez(W®) (4.139a) 
and 


where the i, j (k, D indices belong to untransformed (transformed) tensors, 
we have 


P 


Fpi = > (H A ala E A 
m=1 


q . . . 
— X (Wey, (4.140) 
n=1 


The presence of the minus sign reflects the fact that the tensors with upper 
indices correspond to complex conjugate representations, as compared to 
those with lower indices (see eqns (4.94) and (4.95)). In particular for the 
diagonal generators F = D, (4.137) we have 


p q os 
Dwteky=| È (di), — 2 ay, ee (4.141) 
m=1 n=1 


Thus the quantum number of the tensor is simply the algebraic sum of the 
corresponding quantum numbers of the component vectors which make up 
the tensor. 


We will summarize this discussion by working through the simple example 
of the j = 1 representation of SU(2). Instead of using the m = 1,0, — 1 states 
as in (4.51) and as its hermitian generator the 3 x 3 matrices in (4.52) and 
(4.55), in the tensor method approach the bases are taken to be w;; ~ wiw re 
The indices i, j = 1, 2 are symmetrized (see (4.68)). The superscripts A and B 
distinguish the two vectors. The real generators for the SU(2) defining 
representation are 2 x 2 matrices: 


1/2 0 
wi = —-wi-(" 
0 -1/2 


is the diagonal generator, giving eigenvalues d(W,) = 1/2 and d(Y) = —1/2 
for the two states in Y;. We can read off the quantum numbers for the triplet 
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states as 
AwWi)=7+7=1, d(Wy2) =3-4 =0, 
A(Wo2) = —3 -4 = —1. (4.142) 


is the lowering operator. Thus, 


Wh, = ( d Wi? 4 ( ) Wid? = p4p? + Wed. (4.143) 
1 0/7, 1 O/7, 

One last comment—the adjoint representation which we have defined in 
terms of the structure constants of a group (4.19) takes on a particularly 
simple form in tensor notation. The basis for adjoint representation is simply 
yi with wi = 0, where its Young tableau is the last one in (4.125) and 
evidently is self-conjugate. One can show that this is the correct identification 
by converting w using a method similar to that in eqn (4.133) to an n? — 1 
component vector h° using 


p° = 2HY. (4.144) 


The transformation law (4.140) can be used to demonstrate that the 
matrices for its generators (F%),. are indeed the structure constants f,,.. 


4.4 The quark model 


Group theory is relevant in physics because the various symmetry 
transformations which leave the physical system invariant form a group. The 
consequence of symmetry can then be deduced through group-theoretical 
analysis, independent of any detailed dynamical considerations. For ex- 
ample, if a quaantum-mechanical system, described by the Hamiltonian H(r), 
has no preferred direction, all rotation operators R(0) will leave the 
Hamiltonian invariant, 


ROHR- !(0) = H(r). (4.145) 
Or, in terms of the generators of the rotation, R(0) = e® `, this gives 
[H, J] = 0. (4.146) 
The consequence of this symmetry, 1.e. (4.145), is that 
A(J;|n>) = E,(Ji|n>) (4.147) 
i Hin) = Eln). (4.148) 


Thus, all states connected by a rotation transformation are degenerate. These 
states form the basis vectors for irreducible representations (j) of the group. 
From the result on the dimensionality of the irreducible representations 
in SO(3) we conclude that there is a (2j + 1) degeneracy of energy levels. 
In internal symmetries the states are identified with various particles. Such 
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symmetry transformations change particle labels but not the coordinate 
system, and irreducible representations of the group manifest themselves as 
degenerate particle multiplets. In this section we shall present a brief dis- 
cussion of the early successful internal symmetries of the strong interaction. 
First we have the isotopic spin, or isospin, invariance of SU(2). It is later 
found to be part of a larger group SU(3). This is the Eightfold Way theory of 
Gell-Mann (1961) and Ne’eman (1961). This in turn led to the proposal by 
Gell-Mann (1964b) and by Zweig (1964a) that quarks are constituents of 
hadrons. Our purpose here is to give an informal historical introduction and 
to establish, so to speak, the kinematics of the quark model—all in 
preparation for the study of the dynamics, with quarks being the funda- 
mental matter field. 


Isospin invariance—SU(2) symmetry 


In the early studies of nuclear reactions it was found that, to a good 
approximation, the nuclear forces (strong interactions) are independent of 
the electric charge carried by nucleons. The strong interactions are invariant 
under a transformation which interchanges proton (p) and neutron (n). 
More precisely the strong interaction has an SU(2) isospin symmetry in 
which the p and n states form an isospin doublet. Thus the group structure of 
isospin symmetry is very similar to that of the usual spin. The isospin 
generators 7; satisfy the Lie algebra of SU(2) 


LT; T; | = il&ijk Ik (4.149) 


where the indices range from 1 to 3. That p and n form a doublet È means 
n 
that (see eqn (4.45)) 


Tlp) = 3IP>. T3|n> = —żln> 
and 
T,|n>= (p>,  T_|p> = In). (4.150) 


That the strong interaction does not distinguish n from p means that the 
strong-interaction Hamiltonian H, has the property 

[7;, H,| = 0 a 2a (4.151) 
The concept of isospin can be extended to other hadrons. For example, 
(x*,n°, n7), (2°, 2°, E7), and (p*,p°,p ) are (T= 1) isotriplets; 
(K+, K°), (K°, K7), and (=°, 27) are (T = 1/2) doublets; n, œ, p, and A are 
(T = 0) isosinglets. 

Since different members of the isospin multiplet have different electric 
charges, the electromagnetic interaction clearly does not respect the isospin 
symmetry. Thus isospin cannot be an exact symmetry. How good a 
symmetry is it? If the symmetry is an exact one, we have 


[T,, H] =0 (4.152) 


for the total Hamiltonian of the system; all members of an isomultiplet 
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would be strictly degenerate in mass. Thus the mass differences within an 
isomultiplet are a good measure of the symmetry breaking. Experimentally 
they are typically at most a few per cent of the masses themselves, e.g. 

m, —m 


P ~ 0.7 x 107? 
m, + m, M,+ + M,0 


Mat T MR S 1.7 x 1072 etc. (4.153) 


We conclude that isospin is a rather good symmetry and write the total 
Hamiltonian as 


H = H + Ħ, 
where 
[H,, T;] = 9, [H,,7,] 40 (4.154) 
with 
Hy > H. (4.155) 


Thus we can treat the symmetry-breaking part (H,) as a small perturbation. 
As we have mentioned, the electromagnetic interaction must belong to H,. It 
turns out that weak interactions also violate isospin symmetry. One 
interesting question is whether the strong interaction contains a part that 
does not respect isospin invariance. We shall return to this question of a 
possible small isospin violation by the strong interaction in §5.5. 


SU(3) symmetry and the quark model 


When the A and K particles were discovered they were found to be produced 
copiously but to decay with a long lifetime. It was postulated that these ‘new’ 
particles possessed a new additive quantum number, strangeness S, which is 
conserved in the strong interaction (associated production) but is violated in 
the decay of these particles via the weak interaction. For example, the pions 
and nucleons have zero strangeness but S(A°) = —1, S(K°) = +1, so that 
we have the strangeness-conserving strong production nm + p > A? + K® 
which is followed by the strangeness-changing weak decays, A? > nm +p 
and K? +x* +27. The strangeness S, like the electric charge Q, is 
associated with a U(1) symmetry. In fact it was noted that there is a linear 
relation, the Gell-Mann—Nishijima relation, among S, Q, and the diagonal 
generator T of the isospin SU(2) (Gell-Mann 1953; Nishijima and Nakano 
1953), 


Y 
Q=T3+ J (4.156) 
with 
Y=8+S 


where B is the baryon number and Y is called the hypercharge. Thus isospin 
and strangeness (or hypercharge) are only approximately conserved, but a 
certain linear combination, the electric charge, is preserved by all known 
interactions. 

The search continued for ‘higher symmetry’ that could incorporate isospin 
T, and hypercharge Y together in one group by enlarging the multiplet, i.e. to 
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find a larger simple group which contains SU(2); x U(1)y as a subgroup. 
Gell-Mann (1961) and Ne’eman (1961) pointed out that we could group all 
mesons or baryons with the same spin and parity on the (T3, Y) plot (Fig. 
4.9), and they look very much like representations of the SU(3) group (Fig. 
4.6(c), (d)). One sees that 07, 17 mesons and 1/2* baryons fit nicely into the 
octet representation (p = q = 1) while 3/2* baryons fit the decuplet represen- 
tation (p = 3, q = 0). The octet particles y} being tensors of only two indices 
can be written in matrices 


aaa on + 
2 6 T K 
= 2.30 0 
M= - T yil Ko 
J2 J6 
R= go 
J6 
(0) œ 
atp * OK 
Dp? o? 
j= pr oe ee K*0 (4.157) 
7 — 20° 
K*- po 
J6 
5 AS 
foal E rt 
2" Yo 
=Y A? 
B = È- — + —— 
72 + J6 n 
mia =0 —2A° 
= E 76 


Of course at the time of the Eightfold-Way proposal not all the mesons and 
baryons predicted by this pattern were well established. The discovery of 
Q- (Barnes et al. 1964) at the predicted mass value and with the correct 
decay properties (Gell-Mann 19625) played an important role in convincing 
a large segment of the physics community as to the correctness of this 
SU(3) classification scheme. Clearly this SU(3) is not as good a symmetry 
as the isospin SU(2). A measure of the SU(3) breaking is the mass splitting 
within the multiplet, e.g. (ms — my)/(ms + my) x 0.12. 

One notable feature of the hadron spectrum in the Eightfold-Way scheme 
is that the fundamental (or defining) representation of SU(3) (Fig. 4.6(a), (b)) 
is not identified with any known particles. The significance of the funda- 
mental representation in any SU(m”) group is that all higher-dimensional 
representations can be built out of the tensor products of the fundamental 
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representation. This property is particularly transparent in the tensor- 
method approach to SU(n) (see §4.3). On the other hand, more and more 
strongly interacting ‘elementary particles’ had been discovered. It is difficult 
to believe that all these hadrons are truly elementary and devoid of structure. 


Fic. 4.9. Hadrons in SU(3) representations. Octets for (a) 07 mesons; (b) 17 mesons; (c) 4+ 
baryons; and decuplet for (d) 3/2* baryons. 


Against this background, the quark model was proposed, in which all 
hadrons are built out of spin-1/2 quarks which transform as members of 
the fundamental representation (p = 1, g = 0) of SU(3). (Clearly even if one 
does not believe in the physical reality of quarks, they are a useful mnemonic 
device for the less familiar group of SU(3).) 


(1) There are three types (flavours) of quarks, ‘up’, ‘down’, and ‘strange’, 
in the fundamental representation, 3 


qi u 
q= qa.} =] d (4.158) 
q3 S 


corresponding to a Young tableau of [ |. The members have quantum 
numbers 


S 


2/3 1/2 1⁄2 13 0 1/3 
—1/3 172 -12 13 0 1⁄3 
as © © = E I2 


Qu. 


Nn 
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Their antiparticles, called antiquarks, are in the conjugate representation, 3* 


q a 
g =| q }={ d (4.159) 
q° 5 


corresponding to the Young tableau - Their additive quantum numbers 


are just the negative of those for the quarks 


QO T T, Y S B 

u 23 12 212 -=1: 0 1/3 

d 13 12 12 =13 0 —1/3 
1 


5 13 0 0 2/3 —1/3 


(2) The mesons (B = 0) are gq bound states. From 3 x 3* = 1 + 8 we 
have mesons in SU(3) singlets and octets. For the 07 mesons, we have 


n> ~ du, n? ~ (ŭu — dd)/ V2, n` ~ id, 
K*+~su, K®°~8d, K? ~ds, K` ~iss, 
n° ~ (au + dd — 28s)//6. (4.160) 


Similarly octet 1~ vector mesons have the same quark contents. The 07 
meson 17’ and the 1~ meson ‘p’ (more on this later) can be identified with the 
SU(3) singlet q'q; = (Gu + dd + 5s). 


(3) The baryons (B = 1) are qqq bound states. From the multiplications 


L)*Ci=- +C ie.3 x 3=3* + 6 


0A -+p ie. 3 x 3*=14+8 


= Sa Tan i.e. 3 x 6 = 8 + 10, 


we have 
3x3x3=1+8+8+10. (4.161) 


The octet parts have the same quantum numbers (73, Y) as the octet mesons, 
even though they have different quark contents, because T, and Y (also total 
isospin 7) are generators of the SU(3) group and their eigenvalues for a given 
representation are uniquely defined. Meson octet states and baryon octet 
states will have a different baryon number; B is not a generator of SU(3). 


Specially, for the 1/2* baryons + 


p~ udu, n~ udd, 
E+ ~ suu, X°? ~ s(ud + du)/ V2, E` ~ sdd, 
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2° ~ ssu, =” ~ ssd, 


A° ~ s(ud — du)/ vZ (4.162) 
and for the 3/2* baryons [| T [_| 
N***+ ~ uuu, N** ~ uud, N*° ~ udd, N*~ ~ ddd 
E*t ~ suu, E*° ~ sud, L*~ ~ sdd, 
E*!~ ssu, E*” ~ ssd, 


QT ~ sss. (4.163) 


The Gell-Mann—Okubo mass formula 


Here we study the hadron spectrum in the presence of the SU(3)-breaking 
H, « Ho similar to (4.154). The isospin, as mentioned above, seems to be a 
good symmetry, hence the mass difference in the isospin multiplet can be 
neglected in discussing the SU(3) breaking. We can proceed in a pure group- 
theoretical manner. With an assumption about the SU(3) transformation 
property of H, (~Y, or equivalently ~F,), the relation among masses of 
isospin multiplets in a given SU(3) representation can be derived (Gell- 
Mann 1961; Okubo 1962). Here we will demonstrate this with a simple 
calculation in the quark model: we assume that the binding energies of 
quarks are independent of quark flavours (this can be justified later) and 
that the mass differences in an SU(3) representation are entirely due to the 
quark mass difference. This is a specific realization of the H, ~ Fg assump- 
tion. In the approximation of exact SU(2) isospin symmetry, we have 
m, = ma. First consider the 07 meson masses. In terms of the quark masses we 
have from (4.160) 


m2 = Mo + 2m, 
mẹ = mo + m, + m, 
m? = mo + 3(m, + 2m,) (4.164) 


where mọ is the flavour-independent common mass. We have used the 
quadratic mass for mesons. The principal reason is that this works better 
(than linear masses). A possible justification is that 07 meson masses vanish 
in the SU(3) symmetry limit (see Chapter 5) and perturbation of the energy 
around such a value automatically leads to a relation among quadratic 
masses. From (4.164) we obtain 


4m? = m2 + 3mž. (4.165) 


Experimentally the left-hand side ~ 0.98 GeV?, and the right-hand 
side ~ 0.92 GeV”. Thus this mass relation is good to a few per cent. Similarly 
for the 1/2” baryons from (4.162), 


My = Mo + 3m, 


My = Mo + 2m, + mM, 
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mz = Mo + m, + 2m, 

m, = Mo + 2m, + M,. 
Eliminating the three parameters mo, m,, and m,, we have one relation 
among the baryon masses 

Ms + 3m, 

2 

which is experimentally very well satisfied. We have the left-hand 
side ~ 2.23 GeV and the right-hand side ~ 2.25 GeV. The quark model 


allows us to identify the particular mass shifts from the SU(3) average value 
my. This yields an additional relation 


m, = mz. (4.167) 


For the 3/2* baryon decuplet (4.163) we easily derive the equal-spacing rule 
Mo — Max = Mas — My» = Myx — Myx. (4.168) 


In fact the mass of Q` was first (correctly) predicted from this rule. 


@—@ mixings 


The Gell-Mann—Okubo mass formula for the 1~ vector meson multiplet does 
not seem to work. In analogy to the 0” mesons (4.165), we would have 


3m2 = 4m2. — m?. (4.169) 


p 


With my» = 890 MeV and m, = 770 MeV, this equation would predict 
m, = 926.5 MeV, while the experimental value is m,, = 783 MeV. It turns out 
that there is another 1~ vector meson 6 with the same quantum numbers as 
o (i.e. T = 0 and S = 0) and it has a mass of 1020 MeV. This leads to the idea 
that œ is not a pure SU(3) octet state but has a mixture of an SU(3) singlet 
state (Sakurai 1962). Let V, be the T = Y = 0 member of the SU(3) octet and 
V, be the SU(3) singlet state, then œ is a linear combination of V and V, 
while @ is the other orthogonal combination. More precisely, write the mass 
matrix in V and V, space as 


meg me 
M = ( a (4.170) 
The wavefunctions for Vg and V, are 
Vs = (ŭu + dd — 25s)/ V6 
V, = (uu + dd + 5s)/ V3. (4.171) 


The eigenvalues of M should be m, and m, 


2 
rur=(™ © (4.172) 
0 m 


R=( cos 0 ae (4.173) 


—sin@ cos@ 


where 
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Hence 
œ = cos 0 V; — sin 0 V; 


o = sin Vz + cos 0 V,. (4.174) 


Since (4.169) predicts that mgg is 926.5 MeV, we can calculate the mixing 
angle. From (4.172), 


2 


M = Rè" zje (4.175) 
0 My 


we have 
sin 6 = [(mgg — m3 )/(m3 — m2)]"!? 
= 0.76. (4.176) 


The fact that sin@ is very close to ./(2/3)=0.81 has the following 
significance. If sin 0 is exactly ./(2/3), called ‘ideal mixing’, we have 


© = (ŭu + dd)/./2 (4.177) 
b = 5s. (4.178) 


Namely, for ideal mixing, ọ is completely built out of the strange quarks and 
œ out of the non-strange quarks. Thus with the value of 0 in (4.176) it is clear 
that o is predominantly 5s and has very few ūu and dd components. 


The Zweig rule and the discovery of ‘charm’ 


As œ and ọ have the same quantum numbers T = 0, Y = 0, one would expect 
that they should have very similar strong-interaction properties. In particular 
their strong decay widths should be comparable. Experimentally this is not 
so. œ decays predominantly, as it should, into the 3x channel, while @ > 311s 
suppressed relative to ¢ — KK even though the phase space for the latter 
decay is very small (m, is barely above 2m, ~ 998 MeV). This indicates a 
strong preference for @ to decay into channels involving strange particles 
rather than into channels without strange particles. To explain this, Zweig 
(1964b) and also others independently (Okubo 1963; Iizuka 1966) suggested 
that strong processes in which the final states can only be reached through q 
and g annihilations are suppressed. Thus, since ọ is a predominantly 5s state, 
the decay into pions must proceed through the annihilation diagram of Fig. 
4.10(a) while the decay into the KK channel involves no annihilation of s 
and 5, as shown in Fig. 4.10(b). 


4 
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Fic. 4.10. ọ decays: (a) disallowed; (b) allowed by the Zweig rule. 
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In 1974 the W/J(3100) particle was dicovered (Aubert et al. 1974; Augustin 
et al. 1974). It has the rather unusual property of having a width (only about 
70 KeV) much narrower than the widths of typical hadrons (e.g. 
I, ~ 150 MeV, I’, ~ 10 MeV). Also width should increase with mass as 
there will be more decay phase space. The interpretation is that W/J is a 
bound state of a new heavy quark, the charmed quark c and its antiparticle C, 
i.e. W~ Cc. It is below the threshold for Zweig-rule allowed decays into 
charmed mesons (bound states involving at least one charmed quark), see 


Fig. 4.11. 
0 zD 
A _Yf 
a q T 4 
mt d 


cf pt 


(a) (b) 
Fic. 4.11. Zweig-rule allowed decays such as (b) y > D*D- are forbidden by phase space. 


For a detailed discussion on charmed particles see Gaillard, Lee, and 
Rosner (1975). Here we only note that this new 2/3-charged quark was 
predicted earlier on the basis of lepton-quark symmetry (Bjorken and 
Glashow 1964) and, more compellingly, on the basis of the requirement to 
suppress strangeness-changing neutral-current effects (Glashow, Iliopoulos 
and Maiani 1970). With this new quantum number the flavour symmetry is 
enlarged from SU(3) to SU(4). Of course SU(4) is badly broken as m, is heavy 
(~1.5 GeV). Consideration of such badly broken symmetry is no longer 
particularly meaningful. One should go directly to the dynamical considera- 
tion of quark models of such hadrons carrying new quantum numbers. For 
example, we have the additional 0° meson states 


D+ ~dce, D7 ~ēd, D®~iic, D° ~u, 
Ft ~se, Fo ~G, Ne~ če. (4.179) 


In fact one of the most convincing bits of evidence for the quark model is the 
detailed verification of level structure and transitions among the various (Cc) 
‘quarkonium’ states. 

In 1977 yet another set of narrow-resonance Ys were discovered (Herb et 
al. 1977; Lederman 1978) and they were successfully interpreted as bound 
states of yet another heavy quark, b (for ‘beauty’ or ‘bottom’) carrying 
charge — 1/3 with a very large mass m, ~ 5 GeV. 

As we shall see (especially §11.3), from the pattern of fermion family 
replication in the standard electroweak theory one anticipates at least one 
more superheavy flavour of quark: this quark, t (for ‘truth’ or ‘top’), should 
carry charge 2/3. It is to be associated with the b-quark in the same way as 
pairing of (u, d) and (c, s). 
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The pioneering paper on heavy quarkonium is Appelquist and Politzer 
(1975). (For more recent discussions, including (bb) quarkonium, and 
reviews see, for example, Appelquist, Barnett, and Lane 1978; Quigg and 
Rosner 1979; Eichten et al. 1980; Shifman 1981.) 


Paradoxes of the simple quark model 


By simple quark model we mean the model of three, or more, types of quarks 
as originally invented with no hidden degrees of freedom. This simple model 
has the following difficulties. 


(1) The quarks have fractional electric charges while all the observed 
hadrons have integer charges. With charge conservation, this implies that at 
least one of the quarks is absolutely stable. The fractionally charged stable 
quark has been searched for and so far there is no generally accepted positive 
evidence for its detection (see however LaRue, Fairbank and Hebard 1977). 

(2) Hadrons are seen to be built exclusively out of qq and qqq states (and 
their conjugates). There is no evidence for qq and qqqq bound states. It is 
difficult to understand the absence of such hadron states with masses 
comparable to the observed particles. 

(3) The most serious problem is that the J? = 3/2* decuplet baryon 
wavefunctions seem to violate the connection between spin and statistics. 
Take the example of N*** ~ uuu. Since it is a ground state for the system of 
three u-quarks, the spatial wavefunction has zero total angular momentum 
and is totally symmetric. But N** * has spin-3/2 and the spins of all u quarks 
must be lined up in the same direction for the N** * wavefunctions (with the 
third component of spin s = + 3/2) so the spin wavefunction is also totally 
symmetric. Consequently the overall wavefunction is totally symmetric with 
respect to the interchange of any pairs of constituent quarks. This violates 
Fermi—Dirac statistics since the u-quark is a spin-1/2 fermion. 


Colour degree of freedom 


The way out of all these difficulties graduately emerges (Greenberg 1964; 
Han and Nambu 1965; Nambu 1966). It is to postulate that each quark has a 
hidden degree of freedom, called colour. More specifically each type of quark 
is assumed to come in three different colours which form a triplet under a 
colour SU(3) group. Thus for the known quarks we have 


U, = (U1, U2, U3) 
d, = (d,, d,, d3) 


flavour Ss, = (S4, S2, S3) 
C, = (Cy, C3, C3) 
b, = (b, ’ bp, b3) (4.180) 


«——— colour ————> 


The five types (flavours) of quarks correspond to five distinct colour triplets 
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The colour group operators change the quark from one colour to another but 
leave the flavour unchanged: u,u,, u, u3, U;uU,, or d; ed,, 
d, ed}, d,<+d,, etc. Along with this supposition of an extra degree of 
freedom, it is further postulated that only colour singlet states are physically 
observable states. 

Since there are colour singlets in the product of 3 x 3* and 3 x 3 x 3, see 
(4.161), only gq and qqq configurations can bind into physically observable 
hadrons while q, qq, or qqqq states cannot be seen experimentally. The 
N*** wavefunction is now antisymmetric 


N*** ~ u,(x,)ug(x2)u,(x3)e"*”. (4.181) 


where a, B, y are colour indices 1, 2, 3. 

It cannot be overemphasized that this colour SU(3) has nothing to do with 
the original flavour SU(3) of the Eightfold Way. In fact, unlike all the flavour 
SU(m) symmetries with n = 2,3,4,..., the colour SU(3) symmetry is 
assumed to be exact. 

We now have a very peculiar situation where hadrons are composed of 
particles which cannot themselves be directly observed. Quarks can exist only 
inside hadrons and can never be free. This property is usually referred to as 
quark confinement. It should be a part of any viable theory of hadrons as 
quark bound states. 

In recent years physicists have converged on a gauge theory called 
quantum chromodynamics (QCD) in which the colour quantum number 
plays a similar role to that of the electric charge in QED. In QCD the 
coloured quarks will interact with each other through the exchange of the 
gluons in a manner analogous to the exchange of the photon between 
charged particles. These interactions are responsible for the (colour-depen- 
dent and flavour-independent) binding of quarks into hadrons. Even though 
QCD has many attractive features (see Chapter 10), quark confinement has 
not up to now been derived from QCD in a convincing way. But there are 
many arguments (see for example §10.5), which indicate that it should be a 
property of QCD. 


Chiral symmetry of the strong 
interaction 


In §4.4 we studied the flavour symmetry of the strong interaction and its 
physical realization in terms of the quark model. The symmetries SU(2), 
SU(3), etc. are supposed to be manifestations of the quark mass degeneracies 
M, = mų and, to a less good approximation, m, = mg = m,. As it turns out, 
the reason we have such close equalities is not so much that the three quarks 
happen to have equal masses but that they all are light on the typical strong- 
interaction energy scale. Thus the symmetry limits should really be 
M, = m4 = 0, and also to a lesser extent m, = 0, with the corresponding 
flavour symmetries being SU(2), x SU(2)p and SU(3), x SU(3)g, the chiral 
symmetries. However, we do not see any particle degeneracy patterns 
ascribable to such symmetries. The resolution of this paradox lies in the fact 
that the physical vacuum is not invariant (not a singlet) under these chiral 
symmetries and, we say, the symmetry is spontaneously broken. The physical 
manifestation of such symmetry-realization is the presence of a set of near- 
massless bosons: the three pions, and also the whole octet of 07 mesons. 
Here we present the basics of an approach commonly referred to as current 
algebra. The matrix elements of these light pseudoscalar mesons in certain 
kinematic limits are calculated by a direct application of the commutation 
and conservation relations of the chiral symmetry currents. 

This chapter is organized as follows. In §5.1 we discuss the relation 
between symmetry and the conservation laws in field theory, and also 
establish the important result that charge commutation relations are valid 
even in the presence of symmetry-breaking terms. In §5.2 we emphasize the 
point that, in field theory, symmetry currents are actually the physical 
(electromagnetic and weak) currents. Adler’s test of the current algebra (of 
chiral symmetry) in high-energy neutrino scattering is presented. In §5.3 we 
study spontaneous breakdown of global symmetries and the Goldstone 
theorem. This introduces us to the subject of partially conserved axial-vector 
current (PCAC) in §5.4. In the chiral symmetric limit of massless pions we 
can use PCAC and current algebra to derive a number of low-energy 
theorems: the Goldberger—Treiman relation, Adler’s consistency conditions 
on the nN scattering amplitude, the Adler-Weisberger sum rules, etc. In §5.5 
we study the pattern of (explicit) chiral symmetry breaking as revealed by the 
pseudoscalar meson masses and the nN o-term. 

The discussion of this chapter will show that the hadronic interactions 
obey an approximate chiral symmetry, which is realized in the Goldstone 
mode. Thus any satisfactory theory of the strong interaction must have 
these flavour-symmetry properties (also confer comments at the end of 
§5.5). As we shall see, the gauge theory QCD naturally displays such global 
symmetries. 
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5.1 Global symmetries in field theory and current commutators 


Conservation laws in physics can be attributed to symmetry principles. The 
invariance of the physical system under certain symmetry transformations 
implies an appropriate set of conservation laws. In classical physics we have 
the familiar examples 


Translational invariance in time Energy conservation 
dE 
dt 

Translational invariance in space Momentum conservation 
dp’ 

rj => Fi -+ b; <) dt = 0 

Rotational invariance Angular momentum conservation 

dJ 


In quantum mechanics, observables are associated with operators. Their 
time evolution in the Heisenberg picture is governed by the commutator with 
the Hamiltonian 


dO 

— = İ[A, 0]. 5.1 

T = 4. 0] (5.1) 
The conservation law is then equivalent to the statement that the correspond- 
ing operator commutes with the Hamiltonian. For example, the angular- 
momentum conservation dJ,/dt = 0 means that 


LJ, H] = 0. (5.2) 


It follows that the energy levels of the system have a (2j + 1)-fold degeneracy, 
j being the angular momentum eigenvalue. In group theory language the 
Hamiltonian operator is invariant under the rotation group O(3), which has 
the generators J,, J,, and J; satisfying the commutation relation 


[Ji Jj) = 18i jd i,j,k = 1, 2, 3. (5.3) 


The states with definite energy eigenvalues then form representations of the 
group O(3). The degeneracy of the energy levels is associated with the 
dimensionalities of the irreducible representations. 


Noether’s theorem 


In field theory, symmetries and conservation laws are related in a similar 
manner. This connection is made precise by the Noether theorem (Noether 
1918). For a system described by the Lagrangian 


L= [a (P(x), 0,;()) (5.4) 
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with the equation of motion 
ôL ôL 
"5G,0) 56; 
any continuous symmetry transformation which leaves the action S = f L dt 
invariant implies the existence of a conserved current 


-0 (5.5) 


O*J (x) = 0 (5.6) 
with the charge defined by 
O(t) = ee (5.7) 
which is a constant of motion 
d _ (5.8) 
dt l 


because the surface term at infinity being negligibly small 


| d3x oJ = | d3xd"J, = 0. (5.9) 


Noether’s theorem can be illustrated easily in the case of internal symmetry. 
The Lagrangian density # is invariant under some symmetry group G, i.e. 
under the infinitesimal transformation 


P(x) > OX) = d(x) + 0O;,(x) (5.10) 
6o;(x) = 1e“ti.@ (x) 


g's are (x-independent) small parameters and the fs are a set of matrices 
satisfying the Lie algebra of the group G 


cr2, t7) = ic r° (5.11) 


where 


where the Cs are the structure constants of the group G. We have the 
corresponding change in the Lagrangian density 


a Of 

ô Ta 50,6) ° ð 5.12 
Using (5.5) and the fact that 

0(0"9;) = Ou; =, On? = 0,(0g;), (5.13) 
we can write 6¥ as 

ÔL 
ôL =ð 
= ora h * To WOH? 


ÒL 
das i | 


ÒL 
— pl nT ee Un 5.14 
E a Sap iti | (5.14) 
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Clearly if the Lagrangian is invariant under the transformation (5.10), i.e. 
ôL = 0, eqn (5.14) implies a conserved current 


XJ? = 0 
with 
L 
Die E EE 
"=l Sap) 


The conserved charges given by 


$. (5.15) 


Ọ° = [as (5.16) 


are the generators of the symmetry group. 

These types of symmetries which are characterized by the space-time- 
independent parameters ° in (5.10) are called global symmetries. The fields 
@;(x) are transformed in exactly the same way for all space-time points x. 


Example 1. Abelian U(1) symmetry. The Lagrange density given by 
L = Ap) + (Oub2)7] — lI + $2) — Aloi + $3)? (5.17) 
is invariant under the transformation 
Pı > $1 = $, cosa — Q sin g 
$ > $ = Qı sina + d, cos a. (5.18) 


It is the O(2) symmetry corresponding to the invariance under rotations in 
the (¢,, @2) plane. For infinitesimal transformation, « « 1, 


pi = $, — aQ 
Q2 = $2 + 49,, (5.19) 


Aoo 5.20 
NG JO) ee 


According to (5.15), the conserved current is 


J, = —(0,01)02 + (0,62)9 4. (5.21) 
In terms of the complex fields defined as 
l ; 
p= 2 ($: + iġ2) 
ġ* = a — 12), (5.22) 


the Langrangian of (5.17) may be written 
L = (6,6*)(6") — u°($*6) — 1(6* 6)? (5.23) 
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which is invariant under the U(1) transformation 


d > d' =e" (5.24) 
giving rise to the conserved current 
J, = (0,0*)@ — (6,0)0*]. (5.25) 


We note that ¢, and @, (or ¢ and ¢*) are degenerate in mass because of 
the O(2), or U(1), symmetry. 


Example 2. Jsospin symmetry SU(2). We now consider the simple example of 
non-Abelian symmetry. Let @ be an isodoublet 


at sa 


A 
L = (OPNH) — (bb) -5 O'O (5.27) 


The Lagrange density given by 


is invariant under the infinitesimal isospin rotation 
qs: 
1 sog l 
i > Qi = 9 + ie z OF (5.28) 


where the t’s are the standard Pauli matrices. The conserved isospin current 
is given by 


—i 
Ji = y plig; — $18 0,)). (5.29) 
The time-components Jĝ have a simple form 


a =l a a 
ps > (Oop! t3.0; = pith aop) 


—1 


where 7; is the canonical momentum conjugate to ¢;. Using the canonical 
commutators 


[7;(x, t), O(x’, t)] = —1i6,,0° (x — x’), (5.31) 
we can show (see eqn (5.37) below) that the charges defined by 


QO? = [arse a=1,2,3 


satisfy the commutation relations of SU(2) symmetry 


LQ,, Q, | a légpcQe- (5.32) 


This means that the Q,s are the generators of the SU(2) symmetry. Again, the 
1, Pz fields have the same mass because of SU(2) symmetry. 
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Current algebra 


In the above we see that the exact symmetries lead to conserved current and 
their charges generate the group algebra of symmetry transformations. These 
simple commutation relations are useful for classifying particle states. For 
instance, the SU(2) transformations in Example 2 give rise to Q%, a = 1, 2, 3 
which are generators of isospin SU(2) symmetry (5.32). As we have discussed 
in §§4.2 and 4.4, they transform a particle into others within a given isospin 
multiplet. We shall demonstrate below that commutation relations such as 
(5.32) will hold even in the presence of symmetry-breaking terms. Consider 
the Lagrangian 


LG=aL+fL, (5.33) 


where & is invariant under the symmetry group G while Y, is not. Under 
the infinitesimal transformation (5.10) we can still define the current J% as in 
(5.15) but it will no longer be conserved and the charge defined as 


OL 
will not be time-independent. However the factor 6/6(0°¢;) is still the 
canonical momentum conjugate to ġ; even in the presence of Y, 


OL 
(== 5.3 
1%) = F505) (5.35) 
and satisfies the canonical commutation relation at equal time 
[ 7;(x, t), p(y, t) | = —id°(x 7 y) 0;;- (5.36) 


From this we can, without knowing the explicit form of the symmetry- 
breaking term ¥,, calculate the (equal-time) commutator of the charges, 
by using the identity [AB, CD] = A[B, C]D — C[D, A]B when 
[A,C] =[D, B] =0 


[O%(x), O°(t)] = — fa x d°*yLm(x, OX t), MY, Dtudly, 1) 


_ [ars d>y(7;(x, t) ti, LO (x, t), 71, (Y, t) ti (y, t) 


F 7, (Y, tth Enx, t), oly, t) jio, t)) 


= [Pint DIE, Jup, t). 


Or, using (5.11), we have 
LOA, Q°(t)] = iC Qe). (5.37) 


Thus even though the Q%(t)s change with time, at any given instant ¢, the 
commutation relations of the group algebra will still be satisfied. These 
relations are usually referred to as charge algebra. 
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Example. Broken symmetries of the free SU(3) quark model. Here we are 
concerned with the flavour SU(3) group (not colour!). The quark fields are in 
the triplet representation 


qi (x) u(x) 
q(x) = [ qzx) | = | d(x) (5.38) 
q3(x) s(x) 
with the transformation properties 
qi > q; = q; + 18°(4°/2);;q;» at x | (5.39) 
where the A’s are the eight Gell-Mann matrices 
B a _ fot 7 (5.40) 
The f%°s are the SU(3) structure constants. We have the Lagrangian 
SS ee wa a (5.41) 
with 
Lo = iqy"0,q (5.42) 
and 
ZL, = m ūu + mdd + mss. (5.43) 


Lo is SU(3)-invariant while Y, is not. The currents associated with the SU(3) 
transformation are given by 


Vix) = G(x)y,(A2/2)q(x). (5.44) 
The charges defined by 


OX(t) = | o(x) d°x 
will satisfy the SU(3) algebra 


LOC), OO] = if QA) (5.45) 
as a consequence of the canonical commutation relation 
{qai (X, t), any: t)} = 6 ;j0up0° (X A y) (5.46) 


where i, j are the flavour indices and «, p are the Dirac indices. To have exact 
SU(3) symmetry we actually need m, = ma = m,. In the Y, = 0 limit, ZY 
is invariant under transformations of a group larger than SU(3). Besides the 
transformation of (5.39), & is also unchanged under the axial 
transformation 


qi > q; = q; + 1B°(4°/2);;Y5qj» pl. (5.47) 
The corresponding currents are given by 


Ap) = GA), s9(x) (5.48) 
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which are axial-vector currents. Thus even in the presence of the symmetry- 
breaking term Y,, we can define the axial charge O°" 


Q5) = | A$ (x) d3x. (5.49) 


Together with the vector charge ỌQ°(x), the axial charges generate the 
following equal-time commutation relations 


[O*(t), Q°(t)] = if” Q(t) 
[O%(t), Q” A] = if hle) 
[a°%(t), QA] = if” Qe). (5.50) 
These commutation relations correspond to the chiral SU(3), x SU(3)p 
algebra. To see this, we form left-handed and right-handed charges defined 
by 
Qi = (0° — Q”*) (5.51a) 
R = AQ" + 0%). (5.51b) 


Eqn (5.50) may then be written as 
[Ot(s), QLCO] = if” Qi) 
[OR(), ORCO] = if” QRA) 
LOE), Or(t)] = 0. (5.52) 


Thus the Qfs generate the SU(3), algebra while the Qgs generate the SU(3)r 
algebra. 

One can extend the charge algebra (5.37) by considering the equal-time 
commutators of the charges and their currents. With exactly the same 
calculation as that leading to (5.37) we can show that 


LOA, J5(x, t)] = iC*"J5(x, t). (5.53) 


Then from Lorentz covariance, we can include the other components of the 
currents 


LOA), Ja, D] = iC” Jx, t). (5.54) 
Similarly we can go further than (5.53) and have 
[JO(x, t), Joly, D] = iC™Jo(x, t)ô*(x — y). (5.55) 


These relations, and similar extensions of (5.50), are called current algebra, 
see eqn (5.80) below. 

If one tries to include spatial components in the current algebra (5.55), one 
encounters additional terms which vanish upon spatial integration. For 
example, 


[Jo(x, £), Jily, D] = iC” Jx, Nd°(x — y) 


ô 
+ $%(x) — ô?(x — y) (5.56) 
7 ( May, (x-y 
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where S#?(x) is some operator depending on the explicit form of J?(x). This 
term vanishes upon integration over space 


S(x) le 6°(x — y) d?y = 0 (5.57) 
J 


so it will not modify the charge—current algebra of (5.54). Terms of this type 
are called Schwinger terms (Schwinger 1959). A simple argument will show 
that they cannot vanish in general. Consider the simplest case of the U(1) 
symmetry where there is no need of group indices and the structure constants 
vanish. If we assume the absence of Schwinger terms, the commutator (5.56) 
becomes 


[ Jo(x, t), Jy, t)] = 0 (5.58) 
which implies 
[ Jo(x, £), 0; J;(y, t)] = 0. (5.59) 
From current conservation “J, = 0, we obtain 
[ Jo(x, t), OoJoly, t)] = 0. (5.60) 


Taking the vacuum expectation value and inserting a complete set of energy 
elgenstates, we have 


COLIC, £), Fooly, HJI = $ (KO|Jo(x, t)ln><nldoJoly, t)10> 
— COlooJoly, Hina Jox, DIO) 
—j > (eP ~- Ea e7 (XY) 


x E,|O|JoO)In>I?. (5.61) 


Thus in the limit x — y, eqn (5.60) would imply that 
È E,|<0|Jo(0)\n>|? = 0. (5.62) 


From the positivity of energy we must conclude that 
<0|J,(0)|”z> =0 for all jn). (5.63) 


Thus, we have J, = 0 identically and the relation (5.60) is trivial. Therefore, 
we must have a nonvanishing Schwinger term. 

We should also note that the free quark model in fact has two more U(1) 
symmetries. The first U(1) symmetry corresponds to Z (eqn (5.41)) being 
invariant under the common phase change for each of the quark fields 


qi(x) > eq; (x) (5.64) 
with the conserved (baryon-number) current 


T(x) ~ (x); (x). (5.65) 
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The second U(1) symmetry corresponds to Y (eqn (5.42)) being invariant 


under 
q(x) > e?’5q;(x) (5.66) 
with the (partially) conserved (‘axial baryon number’) current 
Ji(x) ~ Gilx)yp7sqi(X). (5.67) 
The problems associated with this axial U(1) symmetry will be discussed in 
§16.3. 


5.2 Symmetry currents as physical currents 


As we studied in the previous chapter, symmetry groups are of great 
importance in particle classification. Now we see in field theoretical studies 
that these symmetry currents (Noether currents) and charges satisfy definite 
commutation relations that are valid even in the presence of symmetry- 
breaking terms. Another important result of field theoretical studies is that 
these symmetry currents are just the physical currents appearing in 
electromagnetic and weak interactions, i.e. the same Noether currents, or 
some linear combinations thereof, appear in the interaction Lagrangian. 
Thus current algebra, which represents symmetries of the strong interaction, 
can be directly tested in electromagnetic or weak-interaction processes 
involving hadrons. For reviews of applications of current algebra see Adler 
and Dashen (1968) and de Alfaro et al. (1973). 


Electromagnetic currents 


The most familiar physical current is the electromagnetic current JẸ (x) 
which is coupled to the photon field A ,(x) in the interaction Lagrangian by 


G™ = eJ™A? (5.68) 


where e is the electromagnetic charge coupling constant. We can decompose 
the current into leptonic and hadronic parts 


HJR + IG. (5.69) 

The leptonic current can be written directly in terms of the charged lepton 
fields 

a = —by,e — Aya +... (5.70) 


where the fermion field operators are denoted by their particle names. Since 
the leptons do not have strong interactions and the electromagnetic 
interaction can be treated perturbatively, we can directly measure J;7 in 
physical processes. In this respect the hadronic electromagnetic current Jj; iS 
quite different. It cannot be written directly in terms of hadronic fields such 
as n, K, N, etc. because they are not elementary constituents. While we can 
express J}; in terms of the quark fields, 


hd. = say ,u — 3dy,d = 48/8 Bee > (5.71) 
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it cannot be used in the same way as the leptonic current. We do not know 
the hadronic wavefunction in terms of the quark fields and quarks have 
strong interactions which must be taken into account to all orders. Until 
this can be done we have to rely on the experimental measurement to learn 
the structure of the matrix elements of this current. On the other hand, the 
electromagnetic current is also a symmetry current. In fact it is conserved by 
all known interactions. The hadronic charge operator 


OF = [oe d?x (5.72) 
obeys the Gell-Mann-Nishijima relation (see §4.4) 
Y 
em _ T Ra 
h 3 + 5 
This implies a similar relation for the corresponding currents, 
m= J} +5}. (5.73) 
In the quark model this corresponds to 
fo = ĝuu — 4d'd —4s's +... 
= (utu — dd) + {utu + dtd — 2sts) +... 
= J3 + 5J6. (5.74) 


Weak currents 


In weak interactions the currents play a similar role toJ $% in electromagnetic 
interaction. We shall see (cf. Chapter 11) that these two interactions are 
‘unified’ in modern gauge theories and that the weak and electromagnetic 
currents are members of one multiplet; hence they really have similar 
theoretical status. In this chapter we shall restrict our discussion to the 
charged weak current J,—to the part of J, that does not bear any of the new 
quantum numbers: ‘Charm’, ‘bottom’, etc. Similarly to (5.68) it is coupled 
to the charged intermediate vector boson (IVB) field W, in the interaction as 


LY = gJ,W? +he. (5.75) 


where g is the coupling constant. From this we have the effective low-energy 
Lagrangian for a current-current interaction 


Gr 
La = —— JIJ? + h.c. 5.76 
where g?°/M}, = G,/./2, as the massive IVB propagator, contributes the M2, 
mass factor in the denominator. Gp œ 107° is the Fermi constant measured 
in units of inverse proton mass squared. The weak current J, can also be 
separated into leptonic and hadronic parts 


Jia Jig sh (5.77) 
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with 
J? = vy (l — yse + V, — ys t+... (5.78) 


where v.(x) and v,(x) are field operators for the neutrino fields. The hadronic 
current can be written in the V—A form 


Ji = [(V} + iV4) — (A+ + i44)] cos 0, 
+ [(V4 + iV2) — (44 + iA4)] sin 0, (5.79) 


where 6, ~ 0.25 is the Cabibbo angle (Cabibbo 1963). The subscripts on the 
vector and axial vector operators are the flavour SU(3) octet indices. The 
various selection rules and symmetry relations implied by the SU(3) and 
SU(2) transformation properties of hadronic currents are well tested in semi- 
leptonic weak processes. For example, the strangeness-conserving vector 
current Vj + iV% and its conjugate V+ — iV4 are partners of the isovector 
electromagnetic current V4 in an isospin triplet and the corresponding 
charges will form the generators of the isospin SU(2) subgroup of SU(3). 
These isospin currents are of course approximately conserved. This is called 
the conserved vector current (CVC) hypothesis (Feynman and Gell-Mann 
1958). Thus from our knowledge of the electromagnetic-current matrix 
elements (form factors) we can predict the strangeness-conserving weak 
vector-current form factor by isospin rotations. Similarly the weak form 
factors of the strangeness-changing vector currents can be fixed by SU(3) 
rotations since all these vector currents are members of the same SU(3) 
octet, etc. Furthermore, these vector and axial vector currents are postulated 
to satisfy the SU(3), x SU(3), algebra (Gell-Mann 1964a) 


[Vialx, t), Vey, t)] = ifanc Ve (x, t)6°(x — y) 
[Vax t), AY, D] = ifaveAe (x, t)6°(x — y) 
[Aa (X, t), A5 (Y, £)] = ifar Ve, t) (x — y). (5.80) 


We note that these relations in which the left-hand sides are quadratic in the 
currents while the right-hand sides are linear will give rise to non-linear 
constraints among currents. Thus the normalizations of the currents are fixed 
by these non-linear commutation relations. 

A great triumph of the quark model of hadrons is its successful and simple 
explanation of all the above symmetry features of the weak hadronic current. 
Very much like the leptonic current in (5.78), the hadronic weak current in 
(5.79) can be written directly in terms of quark fields 


Ji = ūyŻ(1 — y,)d cos 6, + ūyŻ(1 — y,)s sin 0, (5.81) 


and, as we have seen in §5.1, the SU(3), x SU(3)p algebra (5.80) is also 
satisfied in a free quark model. The key problem of course remains of 
finding a fully interacting theory of the quarks in which the strong interaction 
naturally has this approximate global SU(3), x SU(3)r chiral symmetry. 
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Current algebra sum rule 


As an illustration of the type of results that can be obtained from the current 
algebra of (5.80) we will derive the Adler’s sum rule for neutrino scattering 
(Adler 1966). 


v (k) I” (k’) 


N(p) Xn 
Fic. 5.1. 
Consider (Fig. 5.1) the neutrino scattering off a nucleon target producing 


a charged lepton /~ and some hadron state X, (n particles with total 
momentum p,) 


v(k) + N(p) > I (k') + X(p,). (5.82) 
Define 
q=k—k' 
v=p'q4/M (5.83) 


where M is the nucleon mass. In the lab-frame 
Pu = (M, 0, 0, 0), k, = (E, k), k, r= (E’, k’), 
we have 


0 
q = —4EE' sin’ 5 


E ee (5.84) 


where 0 is the angle between k and k’, and the energy is high enough so that 
we can make the approximation of taking the charged lepton mass m, to be 
zero. From (5.76) and (5.78) the amplitude for this process can be written 


Ty = s nk’, Ay — ys)uk, AXKalJilp, o) (5.85) 


and the unpolarized differential cross-section as 


do™ = 11 1 @k IT EA 
( 


n = W 2M 2E (2n)22k) A+ | Omp, 
«5D ITPRO k +p -k =p) (580) 
oAd' 


where o, A, and J’ are spin labels of nucleon, initial and final lepton, res- 
pectively, and 
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If we sum over all possible hadronic final states, we obtain the inclusive cross- 
section 


d?o G2 
yw 5.87 
dig7|dv 32nE2.—* on) 


with the leptonic tensor 
lap = EK Y — yÉ —Ys5)] 
= 8(k,k, + kikg — KK 'Gag + i&agyok k?) (5.88) 


and the hadronic tensor 


n d i 
Ws (P, q) = = =) 2 ll la] 


4M o n 
x n o|Jip(0)in><n| Ina) p, o> (27) 8t (p, — p — 4) 


= Ves |S <p, AJig(y)Jna(O)lp, o> e”. (5.89) 


Note that W$} is non-zero only for qọ = E — E’ > 0. Since it is a second- 
rank tensor depending only on momenta p and q, it can be written as 
Wo = —WY gag + WS Dip p/M* — iW eagra pg? M? 
+ W2)qudp/M? + W3'(Didp + Ppda)/M? 
+iW?(P.dp — Peda)/M? (5.90) 


where the Ws are Lorentz-invariant functions of g? and v of (5.84), called 
structure functions. The cross-section in (5.87) is now 


d*o” Gi/(E 0 0 (E+ E’) 5 
oa 2w 2 Ww) 2 bA "we |. 
dig?| dv — -$(=)) ii S eg ae 7 
(5.91) 
The WY s,s do not appear in the m, = 0 limit. 
Similarly for anti-neutrino scattering 
v+N>l* +X, (5.92) 
we have 
do  GE/(E : 6 E+E) + E’) .. 
IWA He + WẸ c (¥) 
digidv 2r EE) a u + M in? WS J 
(5.93) 


where the structure functions W{” are defined by 
W(P, q) = L <p, o| Jng Y) J ilO) p, o> 
P, 4M £ In P, hg y P, 


= —W gag + WP Papp/M? — iW? eqpy5p"9?/M? 
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W?d44p/M? + WẸ Pade + Poqa)/M? 


+ come — p5q,)/M?. (5.94) 


Again they are non-zero only for qo > 0. We can alter (5.94) slightly using 
translational invariance 


WB P> d) -wll L et p, al Jye) Jia(— yp, 0> 


l y [S ep, alIa OIP, o). (5.95) 
2n 
Now consider the tensor W,, defined by 


WP: 4) = wah |x ep, al[Jig(y), Ja (Olp, o>. (5.96) 


When this is compared to (5.89) and (5.95) we have 
Walp, 9) = WYO, q) for qo>0 
= —WY(p, —q) for qo <0. (5.97) 


To derive the sum rule, we take Wo, and integrate over qo 


| Woolp, q) dgo = | (Wop, q) — Woodly. 9) dao 


o ~s 


I en 
= gy È | eve * <p, olL, 0), MOP, o>. 
(5.98) 


This equal-time commutator can be evaluated using the current algebra of 
(5.80). The simplest way to calculate this is to use the fact that these 
commutation relations are also satisfied in the free-quark model (see §5.1) 
where the current is given by (5.81). Using the canonical anti-commutation 
relations 


{gily, 0), 4;(0)} = 8° (y)ô;; (5.99) 


where the indices are those of the Dirac matrix space as well as the flavour 
space of (5.38), we have 


[Jho(y.0), Jro(0)] = [(cos 0.d'(y, 0) + sin 6,s'(y, 0))(1 — ys)ucy, 0), 
u'(0)(1 — y5)(cos 0,d(0) + sin 6,s(0))] 
= 2(cos 6,d'(y, 0) + sin 6,s'(y, 0))(1 — ys) 
x {u(y, 0), u'(0)}(cos 6,.d(0) + sin 6,s(0)) 
— 2u'(0)(1 — y5)(cos? 0,{d'(y, 0), d(0)} 
+ sin’ 6.{s'(y, 0), s(0)})u(y, 0) 
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= 26°(y)(cos? 6,d"(0)(1 — y5)d(0) 
+ sin? 6,8'(0)(1 — ys)s(0) 
+ sin 0, cos 6,(d‘(0)(1 — y5)s(0)+ s'(0)(1 — y5)d(0)) 
— ut(0)(1 — ys)u(0)) 
= —6°(y)[4 cos? 0,(V§(0) — Ao(0)) 
+ sin? 6,(3(V9(0) — Ao(0)) + 2(Va(0) — Ao(0))) 


+ 4 sin 6, cos 6.(V8(0) — A8(0))] (5.100) 
where 
Vi = (utu + d'd — 2s's)/3 Ao = (uty;u + d'y.d — 2s'y,s)/3 
Vo = (u'u — d'd)/2 Ao = (u'ysu — d'y,d)/2 
V$ = (d's + s'd)/2 Ae = (d'y,s + s'y5d)/2. 
(5.101) 


Since V$ and Aŝ are strangeness-changing operators, their matrix elements 
vanish when taken between nuclear states. Also averaging over nucleon spin, 
we have 


LE <p, ojAilp o> =0 for all i. (5.102) 


Thus (5.98) becomes 


0 


| [WSP 4) — WSd(p, a)] dado = 
(0) 


= [4T cos? 0, + (3Y + 273) sin? 0] (5.103) 
where T, and Y are the isospin and hypercharge of the nucleon state 
52 <p. olV 8p, 0> = 2T ap. 
D <p, al Volp, o> = 2Ypo. (5.104) 


We now proceed to express the left-hand side of (5.103) in terms of the 
structure functions 


| Wo dgo = |r- WY + WY(po/MY + WI'(do/MY 
0 
+ W$'(Poqo/M*)] dao. (3.105) 


A judicial choice of reference frame will simplify this equation. Instead of the 
nucleon rest frame, we will take the infinite-momentum frame (Fubini and 
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Furlan 1965) in which the nucleon has infinite momentum orthogonal to q 
pl > œ «6Uwith p-q=0. (5.106) 

Then in this frame, 

Po = (P? + M*)'? ~ |p| > œ 

v = p'q/M = poqo/M 

q = qo -9° = (VM/po)’ — q > -0° (5.107) 

and the largest term on the right-hand side of (5.105) is WY 


o0 


Jm | W's), dqo = = | dvW(q7, v) (5.108) 
p| > œ 
(0) (0) 


where we have assumed that it is valid to intercharge the limit and the 
integration. Thus the sum rule in (5.103) becomes 


| [Wg v) — WY’, v)] dv = 47; cos? 0, + (3Y + 273) sin? 0, 
0 


2 / 2 cos? 0, + 4sin? 0, for a proton target (5.109) 


—2 cos? 0, + 2 sin? 6, for a neutron target. 


This is the Adler current-algebra sum rule for neutrino scattering. We remark 
that even though it is derived in the infinite-momentum frame, the final result 
expressed in terms of Lorentz invariants should be true in any given frame. It 
has the notable feature that the g*-dependence of the left-hand side gets 
‘integrated away’. This sum rule can be used for any target with appropriate 
T, and Y quantum numbers on the right-hand side. An extension to include 
other additive flavour quantum rules (beyond strangeness) can be carried 
out in a straightforward fashion. 


5.3 Spontaneous breaking of global symmetry, the Goldstone 
theorem 


The SU(3), x SU(3)ęk algebra (5.80) generated by the various physical 
currents suggests that we have a strong-interaction Hamiltonian 


H = H + AH, (5.110) 


where Ho is invariant under SU(3), x SU(3)g transformations and H, is not. 
In the limit of 2 = 0, all generators of the chiral algebra are conserved. We 
would expect particles to form degenerate multiplets corresponding to 
irreducible representations of the group SU(3), x SU(3)r. For example, the 
octet pseudoscalar mesons should be accompanied by an octet of scalar 
mesons, and the J? = (1/2)* baryons should have partners with opposite 
parities. But in reality there is no evidence for this larger multiplet structure. 
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This leads to the idea that SU(3), x SU(3)p symmetry is spontaneously 
broken and that this symmetry of Ho is not realized by the particle spectrum. 


Non-invariance of the ground state as a symmetry-breaking condition 
Let U be an element of the symmetry group which leaves Ho invariant. Then 


and it connects states that form an irreducible representation (basis ) of the 
group 


U|A> = |B}. (5.112) 
From (5.111) and (5.112) it follows immediately that 
E, = <A|HolA> 
= (B\|H,|B> = Ep. (5.113) 


Thus the symmetry of the Hamiltonian Họ is manifest in the degeneracies of 
the energy eigenstates corresponding to the irreducible representations of the 
symmetry group. However implicit in the statement of (5.112) and hence 
(5.113) is the invariance of the ground state under symmetry transformation. 
Since |A> and |B» must be related to the ground state |0 through some 
appropriate creation operators @, and o, 


|4> = 40>,  |B> = a0 (5.114) 


and 

Udb,U' = oz, (5.115) 
eqn (5.112) follows only if 

U|O» = |0>. (5.116) 


When (5.116) is not satisfied, this vitiates (5.113) and the usual symmetry 
consequence of degenerate energy levels. Such a situation is commonly 
referred to as a spontaneous symmetry breakdown. However, it must be 
emphasized that, even though the symmetry is not manifest in the degenerate 
energy levels, there are still symmetry relations coming from the fact the 
Hamiltonian or the Lagrangian is still invariant under the symmetry 
transformation. 


Ferromagnetism as an example of spontaneous symmetry breakdown 


This lack of degeneracy in particle spectra in a symmetry theory may come as 
a surprise, but there are a number of familiar situations where the ground 
state is not a symmetric state. A well-known example is ferromagnetism near 
the Curie temperature Te. For T > Te, all the dipoles are randomly oriented; 
the ground state is rotationally invariant. For T < Te, all the dipoles are 
aligned in some arbitrary direction (spontaneous magnetization) and the 
rotational symmetry is hidden. Consider the description of this phenomenon 
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by the Ginzburg—Landau theory (1950). For temperatures near the Curie 
point, the magnetization M is expected to be small; a power series expansion 
of the free energy density can be made, with higher powers of M being 
neglected 


u(M) = (ôM? + V(M) (5.117) 
where 
V(M) = «,(T)(M-M) + «,(M-M)’. (5.118) 
The energy densities u and V are clearly rotationally symmetric. We have 
assumed a slowly varying field and kept only the first derivatives. The 
(M-M)? term in (5.118), with a positive coefficient a, >0, is included 
because, at the Curie point, «, vanishes 
a,=a(T—T,) with« >Q. 
Since the (ô;M}? term is non-negative, to obtain the ground-state magnetiza- 
tion we simply minimize V (M) 
ôV/ôðM; = 0 (5.119) 
or 
M(a, + 20,.M-:M) = 0. 


For T > Te (i.e. «, > 0), the solution is at M = 0. For T < Te (i.e. a, < 0) 
M = 0 is a local maximum and the minimization fixes the magnitude of 
magnetization (the order parameter) to be 


IM] = (—a,/20,)1/2. (5.120) 


But its direction is unspecified by the theory itself. The ground state, having 
M in some definite direction, is one member of this infinitely degenerate set; 
it is fixed by the boundary condition and is not rotationally symmetric. For 
temperatures below the Curie point the rotational symmetry of the magnet is 
spontaneously broken. Thus the symmetry-breaking condition is the non- 
invariance of the vacuum (ground state) 


UO) #|0>. (5.121) 


For U = exp(ie*Q*), where the fs are the continuous group parameters, 
(5.121) can be expressed by the statement that the symmetry charge does not 
annihilate the vacuum 


0°10) + 0. (5.122) 


An equivalent statement to (5.121) and (5.122) is that certain field operators 
have nonvanishing vacuum expectation values 


<01G,|0> # 0. (5.123) 


This can be seen easily as a symmetry transformation of the type (5.115) or 
(5.10) may be represented in terms of generators as 


LOs, ;] = iti; j (5.124) 
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where ff, is the appropriate representation matrix. Thus (5.122) implies the 
existence of at least some nonvanishing matrix element ¢0|@,;|0>. We should 
also remark that translational invariance of the vacuum state leads to the 
conclusion that these matrix elements are space—time-independent constants, 


<0|P(x)|0> = <0] e? *H(0) e7? “10> 
= <0lo(0)|0>. (5.125) 


The Goldstone theorem 


Spontaneous breakdown of a continuous symmetry implies the existence of 
massless spinless particles. The study of this connection was initiated by 
Nambu (1960; Nambu and Jona-Lasinio 196la, b) and subsequently the 
proofs, with various degrees of rigour and generality, were provided by 
Goldstone (1961) and others (Goldstone, Salam, and Weinberg 1962). Such 
scalar particles are referred to as Nambu—Goldstone bosons or simply as 
Goldstone bosons. In the following we shall first give a formal proof 
(Guralnik et al. 1968). This is followed by a number of illustrative examples. 

Any continuous symmetry of the Lagrangian, by Noether’s theorem, 
implies the existence of a conserved current 


a,J"(x) = 0. (5.126) 


Normally we can convert this into the statement of the charge being a 
constant of motion dQ(t)/dt = 0 with Q(t) = f d°xJo(x, t). However, with 
spontaneous symmetry breakdown (5.123), Q(t) is not well defined because 
of the poor convergence property of the field operator in the integrand. Even 
the weak limit, corresponding to the matrix element <0|Q7(t)|O>, does not 
exist. The translational invariance property of the vacuum state leads to the 
result 


<0|97(1)/0> = | d*x<0| Jo(0)Q(0)|0> 


which diverges because the integrand is nonvanishing and x-independent. 
The nonexistence of Q does not really matter since in actual calculation only 
the commutator of Q need ever appear. For the transformation of some 
generic field operator @(x), we have 


bx) > G'(x) = eH (x) e? 
p(x) + ieLQ, pD] +.... 


Here we shall only assume that the commutator exists and formulate the 
proof entirely in terms of its properties. Current conservation (5.126) implies 
that 


0 = | d3x["J,(x, À, 6(0)] 


= 0° [axt (x, £), 6(0)] + [as ‘[J(x, t), $(0)]. 
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For a large enough surface and thus for large space-like separations the second 
term on the right-hand side vanishes. Hence 


d 
q; LEO: oC) = 0. (5.127) 


This commutator being some combination of fields, if it has nonvanishing 
vacuum expectation value 


COLOCA), G)]I0> =n # 0, (5.128) 


we say that the symmetry is spontaneously broken. After inserting a 
complete set of intermediate states and using a translation operator, (5.128) 
may be written 


È (27)°5>(p,){<0|Jo(0)|n> <nlol(0)0> eis! 
— COIP LAJ (ONO) e=} = y. (5.129) 


The right-hand side is nonvanishing and time-independent because of (5.128) 
and (5.127). Since the positive and negative frequency parts do not mutually 
cancel, (5.129) can be satisfied only if there exists an intermediate state for 
which 


E,=0 forp,=0. 


Thus, it is a massless state (the Goldstone boson). This particle will have the 
property that 


<n|p(0)|0> # 0, <O|Jo(O)|n> 4 0. (5.130) 


Thus it can be connected to vacuum by the current J (0) or the operator G(0). 
This theorem is true independently of perturbation theory. We will illustrate 
it in a few examples. 


Discrete symmetry case 


Goldstone bosons are not expected to be present in the discrete symmetry 
case. Our purpose is to illustrate the circumstance in which the symmetry- 
breaking condition (5.123) can take place. The Lagrange density given by 


l l À 
= 5 (0,0) -3P — 56" (5.131) 


has the discrete symmetry 


eer ETE 4 (5.132) 


Since do¢ is the momentum field conjugate to ġ, the Hamiltonian density is 
given by 


H = 5 (dob)? + 5VO)? +5028? +5 6". 
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Thus the classical potential (energy density) may be identified as 


1 
u(o) = 5 (Vo)? + Vi) 
with 
V(o) = sep + Eagt. (5.133) 


Since the (Vġ)? term is non-negative, the minimum of u(ġ) will have the 
property V@ = 0 with the constant value @ given by minimizing V(@) of 
(5.133) directly. Since the coupling 4 is positive (so that energy is bound from 
below), V(¢) for the two possible cases of u? > 0 and u? < 0 is shown in Fig. 
5.2. For u? < 0, the ground-state field is nonvanishing 


b= +(-H’/d)*”. (5.134) 


V ($) V$) 


u? >0 u 2 <0 
Fic. 5.2. Effective potential with the quadratic term having different signs. 


In quantum field language, the ground-state is the vacuum and the classical 
ground-state fields in (5.134) correspond to the vacuum expectation values 
(VEV) of the field operator @ 


<O|P|O> = v 
with 
v= +(—p?/d)*. (5.135) 


The two possible values in (5.135) correspond to the two possible vacua. One 
can choose either one (and only one) to build the theory. Either choice, say 
v = +(—p?/A)*, clearly breaks the original reflection symmetry ¢ > —¢@ of 
the theory. (Since the Fock spaces built on the two possible vacua are 
mutually orthogonal, it is not meaningful to build a theory based on some 
superposition of the two vacua.) This is the broken symmetry condition. 

Since the symmetry in this case is discrete, we do not expect massless 
Goldstone bosons. To verify this we need to consider small oscillations 
around the true vacuum. Thus define a new quantum field variable with zero 
VEV. In terms of this ‘shifted field’ 


p=o-v, (5.136) 
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the Lagrangian density becomes 
l A 
? =; (0p Y — (=o — Avg”? — 4 pt. (5.137) 


d’ describes a particle with mass (— 27). 


Abelian symmetry case 


The Lagrange density given by 
| | 
=5 (0,0)? + 5 (0,2)? — V(o? + n°?) (5.138) 
with 
= u À 
V(o* + n°) = ar a (0? + n°) + ri (o* + x7) (5.139) 
has (continuous) U(1), i.e. O(2), symmetry 
(") m (z) _ ( cosa sin *)(?) (5.140) 
T T —sina cosa/\z 


The extremum of the potential V is determined by 


OV 
— =o[—y? + Alo? + n”)] = 0 
ôC 
ôV 
== = nlp? + Mo? +22)] = 0. (5.141) 
For yp? > 0, the minimum is at 
o? + 1? =v? (5.142) 
with 
v = (u2/A)?. (5.143) 


A graphic representation of the potential is given in Fig. 5.3. The minima 
consist of points on a circle with radius v in the (ø, n) plane. They are related 


Fic. 5.3. The potential function of eqn (5.139) for u? > 0. 
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to each other through O(2) rotations. Hence they are all equivalent and there 
are an infinite number of degenerate vacua. Any point on this circle may be 
chosen as the true vacuum. We can pick, for example, 


<Olo(0) =v,  <O|n|0) = 0. (5.144) 


Thus the O(2) symmetry is broken by the vacuum state. 
To find the particle spectrum in perturbation theory we consider small 
oscillations around the true minimum and define a shifted field 


Go =o-—D. (5.145) 
The Lagrange density may be written 


l 


À 
L = -[(0,0') + (0 nY] — Ko’? — Avo’ (o? + n?) — 7 (o'* + 27)?. (5.146) 


N | 


There is no quadratic term in the n-field and that in the o’ field has the right 
sign. Therefore, m is the massless Goldstone boson and o’ is a particle with 
mass (2y”)?. In fact it is easy to see this pictorially in Fig. 5.3. Small 
oscillations around any point of the minimum circle may be decomposed into 
the radial and polar angle components. The polar-angle oscillations are 
along an equipotential trajectory and it does not cost any energy and hence 
corresponds to a massless particle. With our choice of vacuum (5.144) the 
polar angle oscillation is along the x direction—hence mn becomes the 
Goldstone boson. 

We shall also examine these features more formally through the 
Goldstone theorem and make connection to the proof of eqns (5.128) and 
(5.130). The conserved current generated by the U(1) symmetry (5.140) is 
given by 


J (x) = [(0,2)o — (0,0)7] (5.147) 


with the associated charge being 


O= [næ d°x = [imo — (dga)n] d?x. (5.148) 
Using the canonical commutation relations 
[d n(x, t), n(y, D] = —id°(x — y) (5.149a) 
[d,0(x, t), o(y, X] = —id3(x — y), (5.149b) 
we can derive 
LO, n(0)] = —10(0) (5.150a) 
LQ, o(0)] = in(0). (5.150b) 


According to the formal proof (eqns (5.128)-(5.130)) of the Goldstone 
theorem, the symmetry-breaking condition (5.144) implies the existence of a 
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massless particle state, in this case the quanta of the n-field with 
<n|n(0)|0> 4 0 
<0|Jo(0)|z> 4 0. (5.151) 


To see more explicitly that the right-hand side is nonvanishing we note that 
(5.129) for our case takes on the form 


> (22)*6°(p,){ <0] Jo(0)|2><n|2(0)|0> e7 
— <O|z(0)|n><n| Jo(0)|0D e'”""} = —iv. (5.152) 
The only contribution on the left-hand side is from the massless m state: 
ijay = |x>. Thus 
d°p ., 
Po 0°(p){<O| Jo(O)In(p) ><x(p)|2(0)|0 
— <O|x(0)|n(p)><2(pP)|Jo(0)|0>} = —iv (5.153) 


which is satisfied for 
CO Jo(0)|n(p)> =ivpo (5.154) 


if the normalization condition <O|n(0)|x(p)> = 1 is taken. We note that 
manifest covariance requires that 


<O| J,,(0)|2(p)> = ivp. (5.155) 
Thus the matrix element of the current divergence is 
<0|6"J,,(0)|t(p)> = vm? (5.156) 
and current conservation implies that either 
v = <Ojo(0)|0> = 0 (5.157) 
or 
m, = 0. (5.158) 


Thus in this example, the nonvanishing VEV, v = <O|o|0>, is related to the 
pion decay constant (see eqn (5.178) below). 


Non-Abelian symmetry case: the SU(2), x SU(2), © model 


Consider a theory (Schwinger 1958; Polkinghorne 1958; Gell-Mann and 
Levy 1960) with the following fields: isotriplet of pions m = (1,, 72, 73), an 
isoscalar o-field and an isodoublet of nucleons N = (p, n). The Lagrangian 
given by 
L =5[(6,0)? + (6,n)7] + Niy"d,N 
+ gN(o + it: ny;)N — V(o? + 17) (5.159) 
with 


2 
= 
V(o2 + n?) = = (0? + n?) +F (0? +n?) (5.160) 
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is invariant under the SU(2) transformations 
o-o0 =0 


ton =N+axn 


N > N' =N- iaz N (5.161) 


for a; « 1 with the conserved currents given by 


Ja = Nays N + entm for a = 1, 2,3. (5.162) 


u 


The SU(2) generators are 
Q* = | a(x) d°x. (5.163) 


This Lagrangian (5.159) is also invariant under the axial SU(2) 
transformations 


o-oo =o+ f'n 


n — rT =n — po 
N > N' =N +iB-5y5N (5.164) 


with the conserved currents given by 


z qT! 
Ar = Nyys > N + (ô on" — (ô, 7°) (5.165) 
and 
Q05 = | a(x) d3x. (5.166) 


These charges generate the SU(2), x SU(2)p algebra 
[O*, QP] = ie™Q" 
LQ", O°] = iet™Q® 
[057 Q5] = ie Q". (5.167) 


The spontaneous symmetry breakdown will happen for u? >0 and the 
minimum of the potential is at 


o? +r? =v? with v = (y7/A)?. (5.168) 
We can choose <0|x|0> = 0 and 
<O|a|0> = v. (5.169) 


With the shifted field defined as o’ = o — v in V(o? + nê); we can easily 
check that ms are the massless Goldstone bosons. Following a similar 
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procedure to that outlined in the previous example, we can work out 
commutators such as 


(0%, n] = —iod”. (5.170) 


And the choice of (5.169) implies that axial charges 0°“ do not annihilate the 
vacuum; in fact 


COASCO # 0. (5.171) 


Thus the SU(2), x SU(2)z symmetry is broken spontaneously into the 
SU(2) symmetry generated by the charges Q“ of (5.163) because 


O70> =0 fora=1, 2,3. (5.172) 


We note that in the original Lagrangian (5.159) there is no nucleon mass term 
because am, NN term would not be invariant under the axial transformation 
(5.164). However the chirally symmetric Yukawa coupling of 
gN(o + it: my;)N generates a mass term for the nucleon after spontaneous 
symmetry breakdown 


gN(o + it: my;)N > guNN + gN(o’ + it: ny;)N (5.173) 
and the isodoublet nucleon mass 1s 
My = gv. (5.174) 


The meson masses are (isoscalar) m, = ./2u and (isotriplet) m, = 0. Thus the 
symmetry of the Lagrangian SU(2), x SU(2)p is not realized in the particle 
spectrum which displays only the isospin SU(2) symmetry. If we have u? < 0, 
then the symmetry is not hidden: o and r will be degenerate in mass and form 
the (2, 2) irreducible representation of the SU(2), x SU(2)p group. 


5.4 PCAC and soft pion theorems 


The symmetry of the Lagrangian is always reflected in the algebra of 
currents. But in spontaneous symmetry breakdown the particle spectrum 
only realizes that portion of the symmetry which is also respected by the 
ground state. Thus in the SU(3), x SU(3)p algebra of electromagnetic and 
weak currents (see §5.2) we want the symmetry to be broken spontaneously in 
such a way that the vacuum is only SU(3) symmetric. The Goldstone 
theorem then implies that there must be eight massless pseudoscalar mesons 
associated with the spontaneously broken axial charges 0°", a= 1,..., 8. 
Clearly in reality we do not have such massless particles but eight relatively 
light mesons, m, K, and n. We then conclude that the flavour 
SU(3), x SU(3)g symmetry must also be broken explicitly and the masses of 
the 0” mesons reflect this chiral symmetry breaking (Dashen 1969) 


H = Hy + dH (5.175a) 


where o is SU(3), x SU(3)g invariant and #’ is not. Also the pion 
isotriplet being much lighter than the Ks and n suggests that we can further 
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decompose the symmetry-breaking Hamiltonian into two terms 
AH” = AHi + ARa (5.175b) 


where #, is SU(2) x SU(2)r invariant and 4, > 42. Thus, SU(2), x SU(2)r 
is a much better symmetry than SU(3), x SU(3)rk. An example is the free- 
quark model where chiral symmetry is broken explicitly by the quark mass 
term (5.43) with m >m, ma and we identify A4., =m5s and 
A,H, = m, ūu + mdd. Thus the pions are expected to have masses propor- 
tional to the nonstrange quark masses, and the kaons and eta meson to have 
masses proportional to the strange quark mass (more on this in the next 
section). In this section we shall derive a number of soft pion theorems which 
hold in the chiral symmetry limit (4, = 0) with pions taken to be massless 
particles. In the next section soft meson theorems sensitive to the structure of 
the symmetry-breaking term 4 W + 42 will be studied. 


PCAC 


As the discussion in §5.3 indicates, the Goldstone bosons 1°(x) have direct 
couplings to the broken axial charges Q° and currents 44 as in (5.171): 


CO|ALO)IW'(p)> =if"p, a,b =1,2,3 (5.176) 


where f® is some nonzero constant. If we assume that the SU(2) isospin 
symmetry is unbroken, it may be written as 


S (5.177) 


where f, is the pion decay constant measured in m* >/* +v. 
Experimentally we have f, ~ 93 MeV. Taking the divergence of the axial- 
vector current, we have 


COJ ASCOT Cp) = Pfam. (5.178) 


Thus, if 2, = 0 in (5.175), the SU(2), x SU(2)r symmetry in the Hamiltonian 
is exact, and 


XAT = 0 (5.179) 


which implies that m2 = 0 in (5.178), as required by the Goldstone theorem. 
However if the symmetry is explicitly broken , A, # 0, we can rewrite (5.178) 


<00" A On’ (p)> = fmz <O O) (p)> (5.180) 
where œ“ is the pion field operator with the normalization 
COPO p) = 6”. 
The generalization of (5.180) into an operator relation 
"AS = fmo" g= 1;2.3 (5.181) 


is known as the partially conserved axial-vector current (PCAC) hypothesis 
(Nambu 1960; Chou 1961; Gell-Mann and Levy 1960). One would think that 
such a relation which connects the weak currents Aj and the strong 
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interacting pion fields ° should have a host of experimental implications. In 
practice, in order to translate this PCAC hypothesis into relations connecting 
physically measurable quantities, additional assumptions need to be made as 
we shall see in the following example. 


Low energy theorems with one soft pion 


(1) Goldberger—Treiman relation. Consider the matrix element of the weak 
axial vector current between nucleon states (p(k’)|(Aj, + iAj)|n(k)> which is 
measured in the neutron B-decays. From Lorentz covariance we have 


<p(k’)\(Ay + iAZ)In(k)> = a,(k’)Ly,9594(47) + Qp¥sha(q?)]un(k) 
(5.182) 


where q = k — k’ is the momentum transfer between n and p. The form 
factors g4, h, are functions of the invariant q*. Experimentally we have 
ga(0) ~ 1.26. The current divergence has the matrix element 


p(k’) 0"(A;, + iA In(k)> = 
iti, (k’)y 5Un(k) [2g a(q") + 97h4(q?)] (5.183) 
where my is the nucleon mass. The PCAS hypothesis of (5.181) with 


bt = (+i? (5.184) 
yields 
(p(k) OA! + iADInEY = Vfim plk ot In(k)> 
2 
Ie gong? iT kysk) (5.185) 
—q z7 Mı 


where g,nn(q*) is the nNN vertex function. The physical pion-nucleon 
coupling constant g,nn 1s defined as 


InNN = Jann (MŽ) (5.186) 


with an experimental value of (Ga/nn/4 14.6. Comparing (5.185) with 
(5.183) we have 


fn 2 2 2 2 
grt He Iunn(1°) = 2MngAlq") + ThA). (5.187) 
If we set q? = 0 in this equation, we have 

InGrnn(0) = Mg 4(0). (5.188) 


This relates the nucleon axial vector coupling g,(0) to the mNN vertex at an 
off-mass-shell point g,nn(0). In order to convert this into a physical relation 
we need to make an additional assumption that the variation g,nn(q’) from 
q? = 0 to q? = m? is small, i.e. that g,nn(q7) is a ‘smooth’ function 


Jxnn(O) > Jnnn(m2) = 9JxNN: (5.189) 
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Only then do we obtain the Goldberger—Treiman relation (1958) 
SeGann = Mng A(O) (5.190) 


which is satisfied within 10 per cent for the above-quoted measurement 
values. 

We should remark that only the form factor h,(q7) has a pion pole term 
corresponding to the diagram in Fig. 5.4 


V 2h 


m- q? J 2gmnn(q")- (5.191) 


pion pole of h,(q*) = 


At the pion mass shell point q? = m2, eqn (5.187) yields a trivial identity 
fe Innn(Mz) = 2fINZ Jann(Mz). 


This example illustrates that the additional smoothness assumption is 
needed in order to obtain relations among physical quantities from the 
PCAC hypothesis. In particular we have to extrapolate the pion field from 
the off-shell point q? = 0 to the on-shell point q? = m2. Since numerically the 
pion mass m2 is rather small on the hadronic scale, this extrapolation is 
believed to cause only a small error. The Goldberger—Treiman relation 
(5.190) serves as a measure of the typical accuracy of this type of 
extrapolation. This means that if we extend the PCAC hypothesis to other 
pseudoscalar mesons, the Ks and n, the extrapolation must be over a much 
larger kinematical region (from 0 to mx or m2). Thus the kaon and eta meson 
PCAC relations are not expected to be as good as those for the pions. 

One can also derive the Goldberger—Treiman relation in the chiral 
SU(2), x SU(2)p limit (A, = 0 in (5.175)). In this symmetric limit the 
currents are conserved 


“A, = 0 (5.192) 
which modifies (5.183) to read 
2Mngalq°) + GP hag’) = 0 (5.193) 
and pions are massless Goldstone bosons. Hence the pole term of h,(q) is at 
q? = 0. From (5.191) we have 
m h,(q’) = age (5.194) 
92> 


which, when combined with (5.193) leads again to (5.190). Thus the deviation 
from the Goldberger-Treiman relation measures the chiral symmetry- 
breaking term 4,.%,. Similarly the derivation from the kaon and eta meson 
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PCAC relations, which we expect to be larger, measured the chiral SU(3), 
x SU(3),-breaking term 1,4, + AH. 

(2) Adler’s consistency condition on the nN scattering amplitude. Consider 
the pion—nucleon scattering amplitude 


<m"(q2)N(2)In"(91)N(p1)> = i(2n)* 64(p, + qı — P2 — G2) THR (5.195) 
which has the invariant decomposition 


(qı + 42) 


> B® lupa): 


Ta, = alpa] 4” +y: 


The invariant amplitudes A and B are functions of the usual Mandelstam 
variables s and t or the more symmetric variables 


v = qı : (Pı + P2)/2M = q2- (Pı + p2)/2M 
VRB = —q1 -q2 /2M. 


We note that v > 0, vg — 0 for either of the pions becoming soft, q} —> 0 or 
gq, > 0. One easily works out the pole-term, i.e. single-nucleon term, 
contribution to the invariant amplitudes. It can be shown that the 
combination of invariant amplitudes (which is just the forward scattering for 


qi = 92) 

T=A+vB (5.196) 
is nonsingular for either g, —> 0 and/or q, > 0. Furthermore we have the 
isospin-even and -odd amplitudes 

A® = ov A) + 44, Ty}A”); 


similarly for B+ and T”. 

To derive the single soft pion theorem we use the standard reduction 
formula (see, for example, Bjorken and Drell 1965) for the one-pion field 
operator in (5.195) 


T%, = i(—g2 + m23<N(p3)16%(0)n"(q,)N(P)> 
-BOR HM) np, AONO) (5.197) 


T N 


where we have used the PCAC relation (5.181). Taking the q, — 0 limit, the 
nonsingular amplitude of (5.196) T (v, vg, 97, 43) vanishes 


T‘*(0, 0, m2, 0) = 0. (5.198a) 


This is Adler’s PCAC consistency condition (Adler 1965a). That TO? is zero in 
this limit is trivial since it is odd under crossing and we expect it to be 
proportional to the variable v. Similarly, we also have the condition, 


T‘*(0, 0, 0, m2) = 0 (5.198b) 
when taking the q, > 0 limit. 
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Low-energy theorems involving two soft pions 
For matrix elements involving more than one current, low-energy theorems 
may be derived if we combine PCAC and current algebra. In fact the physics 
and the mathematical procedure to be used are similar to those of the 
familiar soft-photon theorems which reflects the U(1) gauge invariance of 
charge conservation (see, for example, Low 1954). 

Consider the double divergence of a time-ordered product of two axial 
vector currents 


0KO T (Ai(x)A(y)) = O%05(A(X%0 — Yo)An(x)AR(Y) 

+ (yo — Xo) AR(y)AR(x)) 

= A(A(Xo — Yo) A(x) 6°A2(y) 
+ O(¥o — Xo) 8*A3(y)A™(x) 
— ô(xo — Yo)Ag(x)A0(V) 
+ ôl Yo — Xo)Ao(y)An(x)) 

= T ("Ap (x) 6°A3(y)) 
+ ô(xo — Yo)LAG(x), 6*AR(y)] 
— ôk 5(Xo — Yo) LAT), AoC]. (5-199) 


Sandwiching this identity between nucleon states and taking the Fourier 
transform 


[ats dty e11 ay ey 
we obtain 


qia? fas e" *<N(p2)|T(Ai(x)44(0))IN(P1)> 


2 fats e'{ N(p,)|T(0"A4(x) 6*44(0))IN(p1)) 


—igt<N(P2)16(%o)LAG(0), An(x)IIN(p1)> 
+ <N(p2)|5(Xo)[A0(x), 6*A2(0)]IN(P1)>} (5.200) 


where we have used translational invariance and factored out a (27z)* 
O(p; + qı — P2 — q2). This relation between the matrix elements of currents 
and the matrix elements of divergences is an example of the Ward identities. It 
is the starting point for the derivation of low-energy theorems. The question 
of maintaining the Ward identities in higher-order perturbations will be dis- 
cussed in §§6.1 and 6.2. 

PCAC implies that the first term on the right-hand side of (5.200) is the 
(nucleon) matrix element of a time-ordered product of two-pion operators, 
i.e., it is the nN scattering amplitude. The second term can be evaluated 
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from the SU(2), x SU(2), current algebra of (5.80) 
(Xo) [AG(0), ASC] = —i ôx) V(x). (5.201) 


In principle there is also the contribution from the Schwinger term in this 
commutator. But as it turns out, the time-ordered product defined in (5.199) 
is not covariant because of the singularities in the product T(A%(x)A}(y)) as 
Xo > Yo and an extra term should be added to make it covariant. This extra 
term will cancel the Schwinger term in the commutator of (5.201) (see, for 
example, Adler and Dashen 1968). The net result is that if one uses the usual 
time-ordered product one does not have to be concerned about the 
Schwinger term in the derivation of the Ward identities. 

The third term on the right-hand side of (5.200) is an equal-time 
commutator of a current and a divergence. This commutator, called the o- 
term, is not governed by the current algebra and it depends on the symmetry- 
breaking terms (5.175b). In the following application we shall eventually take 
the limit of p, > p2 = Pp, qı > q2 = q > Q. In such a kinematical configura- 
tion the c-term can be shown on general grounds to be symmetric in the 
indices a and b. To see this, write 


am oN (p, q) = oF =i [ats 5(Xo)<N(p)ILAG(X, xo), 6*A3(0, Xo) IIN(P)>. 
(5.202) 


Using translational invariance and changing variables x to — x, we can write 
(5.202) as 


ON =i | d°x<N(p)I[AG(0, xo), 07-A3(X, Xo) JIN(P)> 


= j | d3x<N(p)I[A3(x, xo), 6°43(0, Xo) JIN(p)> (5.203) 


where we have used the fact that the spatial divergence vanishes upon 
integration over all space. We then have 


ON = iĝo [axana Xo), Ao(0, xo)JIN(p)> 


al | d®x<N(p)I[00A46(x, Xo), Ao(0, Xo) IIN(p)> (5.204) 


or 


ON — ON = iĝo | d°x<N(p)ILAG(x, xo), Ao(0, Xo) IIN(P)>. 


The commutator on the right-hand side will give an isospin charge after the 
integration over space is performed, and it is time-independent if we neglect 
isospin-breaking effects. Thus we have 


o% = ght (5.205) 
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It is also clear from the form of eqn (5.203) that the o-term, being pro- 
portional to 0*A,, represents chiral symmetry-breaking effects. 


Adler—Weisberger relation. To derive the low-energy theorems for nN 
amplitudes involving two soft pions we reduce out both of the pions in 
(5.195) and apply the PCAC formula (5.181) 


Tin = i [ats et (qi — mx )(q2 — mz) 
x (NDITI "6 ODINCD)> 
= iat — miN = mmr fE? | ate 


x <N(p2)|T(0" Ag (x) 6°43(0))|N(p1)>- (5.206) 


Similarly consider the weak axial-vector current amplitude 
(27) ôt (pı + 41 — P2 — qa) TÈ 
= | dtx d4y e" * e" N(p,)T(Aa(x)A5(y))IN(P1)> - (5.207) 


The amplitudes of (5.206) and (5.207) are related by the Ward identity 
(5.200). With the forward scattering kinematics p; = p, =p and q; = 
q = q, we have 


q" T$ = —i(q? — m?) ?m$ f TSh + ivit", 1°]/2 — ioÑ (p, q) 
(5.208) 


where we have used the definitions in (5.206), (5.207), and (5.202). The 
commutation relation in (5.201) implies that the second term on the right- 
hand side of (5.200) takes on the form 


—iq" [ats e *<N(p)6(Xo)L48(0), An) IIN(p)> 


abc 


= e“qhu(p)y cul p)/2 
= 2p: qe t*/2 = —iv[t", 1]/2. (5.209) 


The o-term, as we have demonstrated above, is symmetric in a, b. Since the 
pion has isospin 1, the isospin symmetric f-channel state must be 0 or 2. Only 
the isospin-zero state can contribute here since the nucleon has isospin 1/2. 
Thus we write 

of? = 5 an. (5.210) 


The left-hand side of (5.208) is quite complicated as it involves contributions 
from all possible intermediate states that can couple to the nucleon through 
the axial vector currents. This can be simplified by taking the low-energy 
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limit q, — 0 so that the only surviving terms in T ab are those singular in this 
limit. It is easy to see that such terms correspond to the one-nucleon pole 
diagrams in Fig. (5.5). 


Fic. 5.5. 


(qT E) poe = 2192 {[t?, t7]v — 6%q7}(v? — maq’)/(q* — 4v’) 


(5.211) 
where g, is defined in (5.182). Since q? « v = p'q for small q, we have 
(g'q'T®) poe = igiv[2*, 1/2. (5.212) 
Thus in the low-energy limit, the Ward identity (5.208) becomes 
igtv[t*, 1]/2 = —if2T%, + iv[t’, 1?]/2 — idan. (5.213) 


The forward amplitude T,, is just the combination A + vB of (5.196). We 
have the following soft-pion theorems for the isospin odd and even nN 
amplitudes T‘*)(v, vg, 97, 93): 


lim v~!7(v, 0, 0, 0) = (1 — g2)/f? (5.214) 


v0 


and 
T'*(0, 0, 0, 0) = —a,/f?. (5.215) 


To make contact with physical amplitudes (q? = q} = m2) we can extrapolate 

the result to the physical threshold (scattering length) at v = vo = mm, 
2 

A m;/2, 


vo T (vo, —m;/2, mz, mz) = f1 (Q — 94) + O2) (5.216) 
T’ (Vo, —m;/2, Mz, Mz) = O(A2) (5.217) 


where we have used the fact that m? and the o-term are chiral symmetry- 
breaking effects and are of order O(A,) in (5.175b) (Weinberg 1966). 
Alternatively we can convert the low-energy theorem to sum rules by using 
dispersion relations. For example, since T‘~)(v, q°) is odd under v > —v, we 
can write an unsubtracted dispersion relation for v~ +T Xv, 0), 


T(v,0) 2 f Im TOV’, 0) dv’ 
ee eee ee (5.218) 


v ve—y 


vo 
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Then setting v = 0 and using (5.214), we get 


o0 


[97 .2 |= T” (v, 0) dv 
es 


(5.219) 
T 


vo 


V 


We may use the Goldberger-Treiman relation (5.190) to eliminate fy 


o0 


l 2m4 { Im TO?(v, 0) dv 
+==1+-> | a (5.220) 
GA TNGaNN V 


vo 
If we make the smoothness assumption 
Im TO?!(v, 0) ~ Im TO ?!(v, m2), 


we can relate the on-shell amplitude Im 7‘ (v, m2) to the nN cross-section 
from the optical theorem 


Im TO (v, mz) = vol O) = i O) — of (v)]. (5.221) 


The sum rule in (5.220) becomes 


1 2m | dv[or.P(v) — Sio? (V)] 
aa A e 
ga TNOaNN V 
vo 

This is the Adler-Weisberger relation (Adler 1965b; Weisberger 1966). 
Using experimental values for the mp total cross-sections we get the weak 
axial nucleon coupling g,~1.24 which agrees quite well with the 
experimental value 1.259 + 0.017. 

The isospin-even amplitude is related to the chiral symmetry-breaking o- 
term, which will be examined in the next section. 


(5.222) 


5.5 Pattern of chiral symmetry breaking 


Soft-pion theorems such as the Goldberger—Treiman relation and the Adler- 
Weisberger sum rule are exact chiral SU(2), x SU(2)p symmetric results. 
They are not sensitive to the structure of the symmetry-breaking terms in 
(5.175b). On the other hand, the o-term represents chiral symmetry-breaking 
effects. Consider this commutator of axial current with its divergence 
appearing in (5.199) and (5.203), taken between some general hadronic states 
of momentum p 


Cap =i fa x< B(p)ILA0(x, 0), 6°46(0)]la(p)> 


Doa | dy B(p)ILO™, LH, 0), 43(0)la(p)) 


= [aeoo *, [Æ (0), Ao(y, 0) ]la()> 


<B(p)ILQ°*, LQ”, #0). (5.223) 
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Thus the o-term is simply a double commutator of the Hamiltonian density 
H with two axial charges. If the chiral symmetry-breaking term is absent in 
H, the axial charges are conserved. They commute with the Hamiltonian and 
the o commutator vanishes. Thus we can replace # in (5.223) by the chiral 
symmetry-breaking term (5.175b). 


Measuring the nucleon and vacuum o-terms 


We have already seen (5.215) that the isospin-even nN scattering amplitude 
in the soft-pion limit is proportional to the o-commutator between the 
nucleon states. One must be careful in relating this result for 7‘*(0, 0, 0, 0) 
to the on-shell q? = qå = m? amplitude since this extrapolation involves 
a correction term of the same order as the o-term itself. However a systematic 
expansion in the chiral SU(2), x SU(2)g symmetry-breaking parameter /, of 
(5.175b) is possible (Cheng and Dashen 1971) 


T'”0, 0, m2, m?) = T 0, 0, 0, 0) 
aO SOL? 
+ m; EA + m; T 


Using Adler’s consistency conditions (5.198) such as 


+ O(m®. (5.224) 


eT) 
T'*(0, 0, m2, 0) = T'*(0, 0, 0, 0) + m? 


+ O(m = 0, 
ôqî 


we have 
T‘*)(0, 0, m2, m2) = —T*(0, 0, 0,0) + O(mź) 
= oy/f2 + O(m). (5.225) 


In this expansion in powers of the symmetry-breaking parameter À, (i.e. m2), 
we have ignored any possible non-analyticity problem (Li and Pagels 1971). 
It should be noted that even for the on-shell amplitude the kinematic point 
v = vg = Ois still outside the physical region. However the amplitude value at 
this point can be reliably extrapolated from the physical quantities via 
ordinary dispersion relations. 

So far we have concentrated on the SU(2), x SU(2)p chiral symmetry. The 
generalization to SU(3), x SU(3)p is straightforward. The PCAC relation 
for octet axial-vector currents reads 


HA = fmo a=1,2,...,8 (5.226) 


where the ġ°s are the field operators for the octet pseudoscalar mesons. The 
generalized Goldberger—Treiman relations and soft-meson theorems for 
meson-—baryon scattering amplitudes can be derived in a similar fashion. 

We can also obtain more low-energy theorems in the soft-meson limit by 
considering other matrix elements of the currents. In particular, from (5.226) 
we have 


COJ MARI PAK) = Santa Sa- (5.227) 
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Using the reduction formula and PCAC, we can write this equation as 
i(m? — k?) 
fomi 
: 2 k? l 
— a) fik, | d*x<0|T(0"A4(0) A3(x))|0> e7* * 

fomi 


ÔabM fa = [ats e™ *£0|T(6" AZO) 6 A(x)) 10> 


= [ats e™™ *£0|6(xo)LAG(x), 0" A7,(0)]10) } . (5.228) 
The low-energy theorem is then 
Ô „m2 f2 =i | d*x<0|5(x)[AG(x), 64A2(0)]10> = oè (5.229) 


where 


oo = COL", LQ”, #(0)]]10). 


Thus the pseudoscalar meson masses are related to the vacuum matrix 
elements of the o-term. (This relation (5.229) can also be derived by directly 
sandwiching eqn (5.199) between the vacuum states.) 


The (3, 3*) + (3*, 3) theory of chiral symmetry breaking 


The nucleon and vacuum matrix elements of the c-commutator are related to 
the experimentally measurable quantities through the relations in (5.225) and 
(5.229). We now need a theory of chiral symmetry breaking. What is the 
structure of the A#’ term in (5.175)? 

A simple possibility is that the chiral symmetry is broken by the quark 
mass term only 


AH’ = m ūu + midd + m5s (5.230) 
or 
Ai =m5s and 1,#, = m,ŭu + mdd (5.231) 


since the quark fields transform as 
qı = 2(1 — y5)q ~ (3, 0) 
dr = 2(1 + y5)q ~ (0, 3) 


of the SU(3), x SU(3)p group. AH’, being of the form Ldr + Grq_, trans- 
forms as a member of the (3, 3*) + (3*, 3) representation. This is the theory of 
chiral symmetry-breaking proposed by Gell-Mann, Oakes, and Renner (1968), 
and by Glashow and Weinberg (1968). In group-theoretical language we say 


RK = Colo + C33 + Cgllg (5.232) 


where the u,„s are a set of scalar densities. In terms of the quark fields (5.38) 
and Gell-Mann matrices they have the representation 


u,=GAq, i=0,1,2,...,8 (5.233) 


5.5 Pattern of chiral symmetry breaking 163 
with 

Ag = (2/3)71. (5.234) 
Similarly define the pseudoscalar density as 

v; = —iqd;y5q. (5.235) 


With the representation of u;, v; in (5.233) and (5.235) and Q*, O° in (5.44) 
and (5.48) one can then work out their commutators using the canonical anti- 
commutation relations for quark fields, 


{qi(x, t), ply, t)} = Oap b°(x = y), (5.236) 


where «, 6 label the Dirac and flavour indices. We shall illustrate this 


procedure for the case [Q°%(1), u°(x, t)]. Suppressing all space-time depen- 
dences we have 


ake 7 
fa z 70759; aa | = [qidg, diqs](Ays)ep(A°y®),3/2 


= (—aqlai{qg, qs} + ai{dg, attas — al{ai, ds}d¢ 
T {q}, aq pGs)AVs)ap(Ary”)»a/ 2 


= q'[A*y5, A?y]q/2 = —qys{d*, A°}q/2. (5.237) 
We can define a totally symmetric symbol d,,, by 
A 4? l ab abc A, 

55 a0 l+d a (5.238) 


The nonvanishing elements are 


_1 
diig = dong = dz3g — dggs = 1/ V3, dagg = dssg = desg = drg = —3(3 2), 
d344 TE d355 E —d366 = —d377 = di46 = dis7 = —dy47 = dz56 =e 1/2. 


(5.239) 
Furthermore, if we supplement this with (5.234) and 
doas =\(3)* ĝa, (5.240) 
we have 
(Xo) [Q2(Xo), U;(X, xo)] = —idajV (0) 84x). (5.241a) 
Similarly, 
5(Xo)LQa(Xo), v(x, Xo)] = idajxth,(0) 5°(x) (5.241b) 
5(xo)LQalXo), uj(X, Xo)] = ifajate(0) 5°(x) (5.241c) 
5(Xo)LQa(Xo)s ¥j(X, X0)] = ajxre(0) ô$ (x) (5.241d) 


with a=1,...,8 and j,k =0,1,...,8. The fis are the usual SU(3) 
structure constants with fabo = 0. 
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In the quark model language 
Uy = ($)*?(au + dd + 5s) 
ug = (3)*(au + dd — 25s) 


u, = (au — dd). (5.242) 
The coefficients in (5.232) correspond to quark masses 

Co = (m, + mg + m, 

0o77 /6 u d S 

1 /m + ma 
Cg = PVE) 7 R m, 
l 

C3 = 3 (m, azi ma). (5.243) 


In the following we shall first assume isospin invariance; hence m, = m, or 
c, = 0. The question of isospin violation due to m, 4 m4 will be taken up at 
the end of this section 

AH’ = Cog + Cgltg. (5.244) 


Thus SU(3) symmetry breaking is due entirely to the Cgug term. 


Current quark masses 


The double commutator of the o-term can be calculated using (5.241a) and 
(5.241b). In actual computation it is simpler if one takes 1’ and the F's to 
be 3 x 3 matrices and computes the anticommutator of (5.237) directly. One 
finds (eqn (5.229)) 


2 2_ (M + Ma) 


fim: > <0jūu + ddjO» 
PARAS (m, T m,) 2 = 
fem = ie <Oluu + ss|O» 
a (m, Be Ma) om 4m, is 
fam, = = —- COltiu + ddjO> + -= Ojas|0. (5.245) 


Since the SU(3) symmetry is not spontaneously broken, we will take the 
vacuum to be SU(3)-symmetric, i.e. 


<0lūul0> = <Oldd|O) = <O|ss|0) = p3, (5.246) 


and, from the definition of decay constants (5.176) and conditions for 
spontaneous chiral symmetry-breaking ((eqns (5.169) and (5.154)), we have 


Ie =I =i, =f. (5.247) 
Besides recovering the Gell-Mann—Okubo mass relation 4mg = 3m? + m2, 
we obtain the ratio of quark masses 
m, + mg m2 1 
ae aaa, 5.248 
2m, 2mz — m? 25 ( ) 
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The pseudoscalar masses suggest that the strange quark has a much larger 
mass than the masses of the non-strange quarks u and d. This means that the 
SU(2), x SU(2)gz symmetry is a much better symmetry than SU(3), 
x SU(3)g. In terms of the parameters in (5.244), this means that 


Taas (5.249) 


is not far from the SU(2), x SU(2)ęk symmetric value of —2?. 


nN o-term 


The nN o-term for the symmetry-breaking Hamiltonian (5.244) can be 
similarly computed in the quark model 


On = 4(m, + m)<N|ūu + dd|N). (5.250) 

At this stage of theoretical development we still do not have a reliable 

method for calculating such a matrix element. One possible way to estimate 
this quantity is to invoke the Zweig rule (Cheng 1976) 

<N|ssi[N> ~ 0 (5.251) 


since the nucleon is supposed to contain little strange-quark component (see 
the discussion following eqn (7.85)). Eqn (5.250) may then be written 


On = 4(m, T maX Nļūu + dd = 25s|N> 


3(m, F ma) 


ee ers <N]cguelN) (5.252) 


where we have used eqns (5.242) and (5.243). The nucleon mass shift due to 
the SU(3)-breaking Hamiltonian Cgug is 


Amy = <N|cgug|N> (5.253) 
which can be related to the general baryon octet mass splittings by 
Am, = <B\cgus|B> = a tr(BugB) + B tr(BBug). (5.254) 


On the right-hand side we have written the baryon octet (and ug) in 3 x 3 
matrices (see (4.157)). That there are two terms on the right-hand side reflects 
the fact that there are two 8s in the product 8 x 8 (eqn (4.124)); hence two 
SU(3) scalars are contained in the product 8 x 8 x 8. Reading off eqn 
(5.254) the coefficients « and p are then related to the baryon mass shifts as 


My = Mo + (a — 28) 
Ms = Mo + (a + P) 
Mz = Mo — (2a — P) 
m, = Mo — (a + B) (5.255) 


where we have absorbed an inessential common factor of ./3 into « and $. 
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Besides recovering the Gell-Mann—Okubo mass formula eqn (4.166) we can 
identify 


<Nicgug|N> = (a — 28) = m, — mz. (5.256) 
Hence, from (5.252), 


3(m, + Ma) 


Sa = 2m, A sae 


ON 
For the ratio (5.248) we obtain on ~ 30 MeV. It is clear that there is quite a 
bit of uncertainty in this estimate, yet the nN o-term is one of the few places 
where one can probe the chiral symmetry-breaking pattern with any sort of 
reliability. 


m, # m, and isospin violation by the strong interaction 


To close this section we also examine the possibility of isospin symmetry 
breaking due to m, # ma. Since we already know that electromagnetic 
interaction violates this invariance, we must untangle these two separate 
mechanisms of isospin-breaking; the SU(3), x SU(3),-violating term may be 
written 


AH! = Hy + H, (5.258) 

with 
KH, = m ŭu + mdd + ms (5.259) 
H, = e | d4x T(J OD p(x) (5.260) 


where J”(x) is the electromagnetic current operator and D,„,(x) is the photon 
propagator. 

There are two contributions to the pseudoscalar meson masses: one 
coming from the o-term due to Hn, the other from the o-term due to #%,. 
Eqn (5.229) reads 


fam ô” = of, + of, (5.261) 
where 
om = COLO S, [Q°’, Hn J1/0> (3.262) 
oo, = COLOS, [0°’, #,]]|0>. (5.263) 
But 
[0", #,] =0 (5.264) 


for any electrically neutral 0°* operator. Thus (Dashen 1969) 
Foy(R°) = Go,(K°) = do,(n°) = 0. (5.265) 
Also from SU(3), 1.e., U-spin invariance, we have 


Co (T+) = 6o,(K*) = ys. (5.266) 
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After an entirely similar calculation to that which leads to (5.245) and using 
the SU(3) results (5.246) and (5.247), we have (Weinberg 1977) 


f?m?(n*) = (m, + mau? + u 
f?m? (n°) = (m, + mg)” 
f mK?) = (m, + mJ" + u, 
f?m?(K°) = (mg + mu" 
f?m?(n°) = Hm, + mg + 4m,)p?. (5.267) 
We obtain the (improved) Gell-Mann—Okubo relation, as well as 


m(n°) m, +m, 
m?(K°) m, +m 


(5.268) 


and 
m?(K*) — m’ (K?) — m*(n*) Ma 


m?(K°) — m2(K*) + m(n*) — 2m (n0) m, PRR 


Besides the quark-mass ratio of (5.248) we also obtain 
m/m, = 1.8. (5.270) 


This indicates that u and d quarks are actually not degenerate in mass. This 
helps us to resolve a number of longstanding difficulties in our picture of 
isospin invariance being broken only by electromagnetism: the sign puzzle of 
the proton—-neutron and K* — K° mass differences, and then — 32 problem. 
If one considers this isospin-violating decay n — 3r to proceed via a second- 
order virtual electromagnetic interaction, a straightforward current-algebra 
calculation then predicts that, in the SU(2), x SU(2)r chiral symmetric limit, 
it is strictly forbidden. (For further discussion see §16.3.) It has, of course, 
been well-known for a long time (Feynman and Speisman 1954) that this 
picture for isospin breaking produces the ‘wrong’ sign for the n—p mass 
difference. Also if electromagnetism is the only source of isospin symmetry- 
breaking, we will have (through eqns (5.265) and (5.266)) the Dashen sum rule 


m? (K+) — m2(K°) = m?2(n*) — m(n?) (5.271) 


which again yields a wrong sign for the K*—K®° mass difference. But the 
result in eqn (5.270) shows that there actually is a substantial isospin 
violation coming from the strong interaction itself. 

If m, and m; are so different, how do we account for the smallness of the 
observed isospin violation? This is possible if the u and d mass difference is 
comparable to m, and mg, themselves and they are all small on the typical 
strong-interaction scale. Indeed a modern-day calculation of the n—p mass 
difference (Gasser and Leutwyler 1975) gives 


m, = 4 MeV, ma = 7 MeV (5.272) 


and we now know (see Chapter 10) that the intrinsic strong-interaction 
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mass-energy scale is about 200 to 400 MeV. Thus SU(2) isotopic spin is a 

good symmetry because the chiral SU(2), x SU(2)p is a good symmetry! 
To a lesser extent a similar situation holds for the flavour SU(3) and chiral 

SU(3), x SU(3)p symmetries. From eqns (5.248) and (5.272) we have 


m, ~ 25(m, + mg)/2 ~ 130 MeV. (5.273) 


We should emphasize the quark masses we have been discussing are 
current-algebra quark masses, or current quark masses. They are the 
parameters of the chiral symmetry breaking. In quantum field theory (e.g. in 
the free quark model of (5.41)), they are the parameters appearing directly in 
the Lagrangian. These are different from the constituent quark masses m 
appearing as parameters in (nonrelativistic) bound-state calculations of 
hadrons: Mproton ~ 3M, M, ~ 2m. Since even in the full strong-interaction 
theory of quantum chromodynamics (QCD) we have not been able to solve 
the bound-state problem from first principles, the connection between these 
two types of quark masses has not been rigorously established. But, as a rule 
of thumb, the constituent masses m and current masses m; differ by a 
common constant 


m, =m, + Mo. (5.274) 


mo, being somehow related to the scale parameter of strong interactions, is of 
order 300 MeV. 


Global flavour symmetry of the strong interaction—a summary 


The discussion of this chapter leads us to expect that a viable theory of the 
strong interaction must be approximately chiral SU(3), x SU(3)g symmetric. 
The field theory of free quarks (5.41) indeed displays such a global symmetry. 
It must then be endowed with an interaction that formally (i.e. at the 
Lagrangian level) preserves this invariance. However the dynamics must be 
such that this chiral symmetry is spontaneously broken with the vacuum 
state having quark—antiquark condensate <O|qq|O> # 0, hence not being a 
chiral singlet. This dynamical breaking should be characterized by a 
momentum scale comparable to f,. It should be symmetric with respect to the 
diagonal subgroup SU(3), so that the hadron spectra and interactions 
exhibit the familiar SU(3) symmetry of Eightfold-Way and there is an octet of 
pseudoscalar Goldstone bosons. This chiral symmetry is also broken 
explicitly (hence the Goldstone bosons are not strictly massless) with a 
pattern consistent with the three light quark mass terms in the Lagrangian 
being the soft symmetry breaking terms. As we shall see in Chapter 10, the 
gauge theory of the strong interaction (QCD) has all the features compatible 
with these expectations. 


6 Renormalization and symmetry 


THE topics of renormalization and symmetry are closely related. The 
symmetry relations among Green’s functions are generally known as the 
Ward identities (Ward 1950; Takahashi 1957). In a theory with nontrivial 
symmetry, renormalizability depends critically on the cancellation of diver- 
gences from different sectors of the theory as enforced by the Ward identities. 
This is even more so for gauge theories where we often need to introduce 
spurious degrees of freedom (e.g., photon longitudinal polarization state, 
etc.), and one has to be certain that, through the use of Ward identities, these 
unphysical states are all cancelled in the physical S-matrix elements. On the 
other hand, one is also concerned with the effects of renormalization on the 
symmetries themselves. It is this latter aspect of the relation between 
renormalization and symmetry that we will concentrate on in this chapter. 

In §6.1 we first see how the vector-current Ward identity in the A¢* theory is 
maintained up to one-loop diagrams. However similar considerations in §6.2 
lead us to the conclusion that the QED Ward identities involving axial-vector 
currents are spoiled by renormalization, by the triangle fermion loops. There 
must be ‘anomalous terms’ in the axial-vector-current divergence equation. 
This allows us to derive the correct current-algebra low-energy theorem for 
the n° — 2y decay. 

The last two sections concern the relationship between spontaneous 
symmetry breakdown and renormalization. Again we concentrate on the 
effects of renormalization on symmetry. In §6.3 we present several topics 
illustrating the renormalization of theories with spontaneous symmetry 
breaking. Finally in §6.4 we show how the classical potential, which we have 
used repeatedly to study spontaneous symmetry breaking, can be regarded as 
the first term in a systematic expansion in powers of Planck’s constant and 
how in certain situations, radiative corrections themselves bring about 
spontaneous symmetry breakdown. 


6.1 The vector-current Ward identity and renormalization 


In eqn (5.200) we saw an example of Ward identities. These relations among 
Green’s functions follow from the symmetry properties of the Lagrangian 
(i.e., current conservation, charge commutator, etc.). They play a crucial role 
in the derivation of current-algebra low-energy theorems and, as we shall 
elaborate later on, they are essential in the renormalization programme of 
any theory with nontrivial symmetries. Thus it is important to check that 
these Ward identities are not spoiled by higher-order correction terms in 
perturbation theory. In this section we shall first see, in simple 4@* theory, 
how the vector-current Ward identity is maintained up to one-loop diagrams. 
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The vector-current Ward identity 


The Ad* theory of eqn (5.23) has U(1) symmetry with the conserved (vector) 


current of (5.25). The canonical commutation relation for the complex fields 
of (5.22) 


[3o (x, t), P(X’, t)] = —i5°(x — x’) (6.1) 
then leads to the commutators 
[Jo(x, £), Cx’, t] = iL0oP"(x, t), 6%’, NIEK, t) 
= 0°(x — x')Ọ(x, t) 
[Jo(x, t), f(x’, t)] = —8°(x — x')"(x, t). (6.2) 


Consider the three-point Green’s function given in Fig. 6.1 


pt+q 


p 


Fic. 6.1. The Green’s function of two scalar fields coupled to a vector current. 


G,(P; q) = [ats dty e~ ==? LO|T(T,() (1) 6"(0))10>. (6.3) 


We make the standard current-algebra manipulation 


q°G,(p,q) = —i [ats d4y eT?” ALOIT(J,(x)b(y)6"(0))10> 


-i | dx d¢y ef (OIT(*T,(x)G(y)61(0))10> 


+ COTE — Yo) Jo(x), $(y)16"0)10> 
+ CO|5(Xo)LJo(x), $1(0)1G(»))10>}. (6.4) 


The first term on the right-hand side vanishes because of current conserva- 
tion, 0*J, = 0, and the other terms can be simplified by using (6.2) 


q'G,(p, q) = —i [ats e'P+9*0|T((H(x) $"(0))|0> 
Ti fes e~? *O|T(P"(0)6(y))10>. (6.5) 
The right-hand side is just the propagators for the scalar field 


A(p) = | d4x e~? *<0|T(P(x)"(0))|0>. (6.6) 
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Eqn (6.5) may be written 


which is an example of vector-current Ward identities. 


Z, = 1 and Ward identities 


Here we digress briefly to illustrate an application of the Ward identities. 
They can be used to show that conserved current J,(x) is not renormalized 
as a composite operator. The identity given in (6.7) is for the unrenormal- 
ized fields which satisfy the canonical commutation relation (6.1). (For the 
renormalized fields there will be a factor of Z% + on the right-hand sides of 
the commutation relations.) In terms of the renormalized quantities, using 
the relation (2.167), 


Gi(p, 9) = Z5 ‘Z7 ‘Gp, q) 
A?(p) = Z5 * A(p). 
We can express the Ward identity (6.7) as 


—iZ,q"Gi(p, q) = A?(p + q) — A? (p). 


Since the right-hand side of this equation is cutoff-independent, Z, on the 
left-hand side must also be cutoff-independent, and we do not need any 
counter term to renormalize J,(x). In other words, the conserved current 
J,(x) is not renormalized as a composite operator, i.e., Z; = 1. Thus the 
Ward identity (6.7) also holds for the renormalized quantities: 


—ig"Gi(p, q) = A®(p + q) — AX(p). (6.8) 


From now on we will drop the superscript R in (6.8) with the understanding 
that these Green’s functions refer to the renormalized quantities. 

Such a nonrenormalization result holds for all sorts of conserved 
quantities, in Abelian (as this example shows) and in non-Abelian cases. 
Actually, for currents associated with non-Abelian symmetries, there is a 
direct way to understand this result. These currents must obey fundamental 
commutation relations such as eqn (5.55). The nonlinear nature of commu- 
tators fixes their normalizations so that no renormalization is possible. 


The vector Ward identity at the one-loop level 


It is instructive to see how the one-loop diagrams satisfy the Ward identity of 
(6.7). In terms of the amputated Green’s function I’,(p, q) and the 1PI self- 
energy & defined by (see eqns (2.48) and (2.24)), 


F(p, 4) = LiA(p + q)]~*G,(p, q)LiA(p)]~* 
[A(p)]~* =p? — vw? — X(p), (6.9) 
the Ward identity (6.7) takes the form 
iq’T (p,q) = (p+ 4)" — pP’ — X(p + q) + Xp). (6.10) 
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k+ 
p+q R 
(b) 
(c) 
(d) 


FIG. 6.2. Tree and one-loop diagrams for the vector current and scalar fields vertex function. 


For the contributions of I’,(p, q) Fig. 6.2(a) shows the vertex function in 
the tree approximation 


ig'T® (p, q) = ig'"(—)(2p + q), = (P + 4}? — p? (6.11) 


which is just the lowest-order Ward identity. 
Using dimensional regularization for the one-loop diagram (Fig. 6.2(b)), 
we have 


| =o fdk i i 
ig'T?(p, q) = iq" lone iA Peg (—i)(2k + q), cig 


‘d"k l l 
=i ie a aa 6.12 
| omy la +- u? k- | ie 
For n < 2, the first integral on the right-hand side is convergent and we can 
shift the integration variable k to k — q, to get 


d"k l 1 
iq'T®(p, q) = iÀ E ls = at = Q, (6.13) 


This will still be true when we analytically continue to n > 2, in particular to 
n = 4. The contribution of Fig. 6.2(c) 1s given by 


ighT(p, q) = ig'(—i)(2p + q), Pee ae [E({p + q) — £(0)] 


(6.14) 
where 


iA | d"k i 
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which is independent of the external momentum. We get 


X(p) = X(p) — 20) = 0. (6.16) 
Hence the right-hand side of (6.14) vanishes and we have 
ig’'T(p, q) = 0. (6.17) 
Similarly, we also have 
ig'T®(p, q) = 0. (6.18) 
Thus up to the one-loop order we have, from the sum of Fig. 6.2(a), (b), (c), 
and (d), 
ig'T (p, 4) = (p + 4° — p° (6.19) 


which is the Ward identity of (6.10) because of (6.16). 

We note that the above proof of the Ward identity (6.7) (or (6.10) in 
perturbation theory) involves two important ingredients: (i) the algebraic 
relation of (6.11); and (ii) the translation of the integration variable in going 
from (6.12) to (6.13). Generally, in order to maintain the Ward identity, 
which is the consequence of the symmetry, we must not choose a 
regularization scheme that will spoil the original symmetry. (It is clear that 
the dimensional regularization fulfils those requirements.) 


6.2 Axial-vector-current Ward identity anomaly and r° — 2y 


Following essentially the same steps as in §6.1 we shall see that the validity of 
the (axial) Ward identity is not automatic when there are fermions in the 
theory, even after the theory is regularized symmetrically. This is because 
certain one-loop diagrams introduce anomalous terms which prevent the 
Ward identities from reproducing themselves recursively at higher orders in 
the perturbative expansion. Such anomalies were discovered by Adler (1969, 
1970) and by Bell and Jackiw (1969) in their current-algebra studies. In the 
following we shall present an elementary introduction to this subject of ABJ 
anomalies. 


The tree-level Ward identities and current divergences from the equation of 
motion 


Consider the three-point functions in electrodynamics 
PyvalK1,k2,q) = i [ats dtx LO T(V (x1) V,(x2)A ,(0))|0> et 7 
(6.20) 
Tulki, ka, q) = i | d4x, dix COJ T(V, Vx) P(0))|0> et 6 
(6.21) 
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where V,,, A,, and P are the vector, axial vector, and pseudoscalar currents, 
respectively, 


V Ax) = W(x)y W(x), 
A(x) = (x,y (x), 
P(x) = W(x)ysw(x), (6.22) 
and 
q= k, +k. 


For the Ward identities relating 7,,,, and T,,,, we need the divergences of V, 
and A, which are calculated from the equation of motion 


V (x)=0 
OYA (x) = 2imP (x) (6.23) 


where m is the mass of the fermion field W(x). With an elementary application 
of current-algebra techniques such as 


eTO) = TOTI A0) + LJo(x), OY)] ôx — Yo) (6.24) 


for the current J,(x) and the local operator @(y) and with the knowledge that 
in our case the equal-time commutators vanish 


[Vo(x), Ao(y)] 0(%0 — Yo) = 9, (6.25) 
we can formally derive the following vector and axial-vector Ward identities 
KET wy, = KaT ava = 9 (6.26) 

and 
QT yy, = 2mT,,. (6.27) 


Anomalies arising from renormalization 


But when we calculate the lowest-order contributions to 7,,,, and T,,, (see 
Figs (6.3) and (6.4) we find that the Ward identities (6.26) and (6.27) are not 
satisfied 


f d*p i i i 
Fama =i [SFU te | yp Eee 
zS) e 
. f d*p i i i 
zio a eD ea a a A 
i Har 4" [pom (p—4)—m' a 


A & ae (6.29) 
uv 


6.2 Axial-vector-current Ward identity anomaly and n° => 2y 175 


ke k, 
Fic. 6.4. Lowest-order contributions to T„, of eqn (6.21). 


To check the Ward identities, in particular (6.27), we can use the relation 


Gs = Ys(P — q — m) + (p—m)ys + 2mys. (6.30) 
to find that 
İT yy, = 2mT,, + AY) + AY (6.31) 
with 


d*p i i 
Go einen ee SE eee, eee 
a T A] 


i l 
k) -m p-a) n) d 


dîêp i i 
A= ss E P 
äi i : L Cm" p-k) m” 


i i 

(p—k,)—m'*""(p—@)—m n) da 
If the integrals A®, vanish we have the Ward identity in (6.27). Superficially 
this appears to be the case. The two integrals in A“) cancel each other if we 
can shift the integration variable p to p + k, in the second term. Similarly the 
other pair of integrals in (6.32b) would cancel. But the integrals are linearly 
divergent and a translation of integration variable produces extra finite terms 
with Al) 4 0 and AQ # 0. This ruins the Ward identity. 
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Shift of integration variable for linearly divergent integrals 


It can easily be demonstrated in one dimension that a shift of integration 
variable may be illegitimate for a divergent integral (see, for example, Jackiw 
1972) 


(e0) 


A(a) = | dx[ f(x + a) — f(x)]. (6.33) 


— œ 


To see that A(a) may be nonzero, we expand the integrand 


r 2 
A(a) = | axl af) +> f"(x) + J 


2 
= aL f(%) — f(-0)] +Z Lf) -f= 0)] +... (6.34) 


where the primes signify differentiation. When the integral f? „f(x)dx 
converges (or at most diverges logarithmically) we have 0 = f (+œ) = 


f'(+œ)= ..., and A(a) vanishes. However, for a linearly divergent integral, 
04 f(+0), 0 =f(+00) =..., and A(a) need not vanish 
A(a) = aL f (co) — f(— o)]. (6.35) 


This corresponds to a ‘surface term’ (‘surface’ in one dimension is the end- 
points). The generalization to a linearly divergent integral in n dimensions is 
straightforward 


A(a) = arcs (r +a) — f (r)] 


ð ð 
= [ede Protege 2r+...| (6.36) 


After applying Gauss’s theorem, all but the first term vanish upon integrating 
over the surface r = R > œ 


A(a) = a = f(R)S,(R) (6.37) 


where S,(R) is the surface area of the hypersphere with radius R. For the case 
of four-dimensional Minkowski space, we have 


A(a) = a’ | dîr 0. f(r) = 2in?a lim R?R,f(R). (6.38) 


Ambiguities in 7,,,, 


The one-loop amplitude 7,,, is (superficially) linearly divergent; hence it is 
not uniquely defined. The expression in (6.28) implies a particular routing of 
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the loop momentum p: the fermion line between the vector and axial vector 
vertices carries momentum p. We could have chosen to route it differently so 
that this fermion line carries p +a, where a is some (arbitrary) linear 
combination of k, and k, 


a=ak,+(a— B)k>. (6.39) 


The fact that integral is linearly divergent implies that T,,, has an ambiguity 
in its definition by an amount 


Anva(@) = Ty ,(@) a T,y,(9) 


1 l l 
x "| oa re ars Vays Praca m Yy (p+a—k,)—m | 


Ln ea a e) 
m” S (p—@)—m E m” @—k)-m" pv 


= AY, + A2), (6.40) 
Applying the result (6.38), we have 


Ô l l 
AM, = (-1) lores yy he tr] iee eo = E TESS | 


—ji2n? a‘ é a 
= ———— lim p’p, tr (2727s pYvYsYu) P" P P?/p 
(27) p> 
_ i2n* as p’p 


Z lim ae 


~ Qn) po Pe 


(6.41) 


After replacing p’p’/p” by g’’/4, we have 
AG) = Epuvad?/8n7. (6.42) 


Since A‘), is related to (6.42) by the exchanges k,  k, and p< v, we have 
from eqns (6.40), (6.42), and (6.39) 


p 
Apuva = AM), + Ai = 822 EsuvalK 1 =~ ka). (6.43) 


Thus the definition of 7,,, has an ambiguity signified by the arbitrary 
parameter p 


Ty ,(@) m Tava) — Be Envap(K1 —k, = Tava(b). (6.44) 


Determination of the anomalous term 


We now attempt to determine fp by imposing the Ward identities. We shall 
see that no value of B exists such that T,,,,(a) satisfies both the vector and 
axial-vector Ward identities (eqns (6.26) and (6.27)). 
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Let us first check the axial Ward identity (6.27). Like those in (6.40) the 
two surface terms in (6.31) can again be evaluated using (6.38) 


i = k3 [e 0 (T m)ysy (p— K, + mml | 
(27)* op. | (p? — m*)[(p — k)? — m*] 


T 


k5 
Ome 2in? lim 53 = (Ya 5 VvV eY pP ki 


—] 


ged EuvopK 1K (6.45) 
and 
Aah (6.46) 
Thus from (6.44) and (6.31), we have 
A es tep 
GT yyi(B) = 2MT (0) — Fa Euvopkik? (6.47) 


For the vector Ward identity (6.26) we have 


l l 1 
KET yya(9) = (—1) abe 4 f Pas Yals (p—qy—m"” poo" | 


l | l 


Using 
= (p — m) — [(p — kı) — m] 
= [(p — k2) —- m] — [E — 4) - m], (6.49) 
we can rewrite (6.48) 
se | 22. l l 
T,y,(9) = ( 1) E tna (p _ q) —m Yv (p pE kK) iy 
6.50 
~ Vals (pk) —m "p—m i (6.50) 


Again the right-hand side is a surface term that can be evaluated using (6.38) 


__* 0 (triya — kz + m)y,(p + m)] 
T ,y,(0) E (2n)* [ap op. (Fe aa) 


[(p — k2)? — m*](p? — m’) 
ki P: 
= On 2in? im pz O57 kap" 


=i 
= 57 ĉrevokík3 (6.51) 


or, with (6.44) 


l 
Tava (b) = ! Sto Evagpk {KS (6.52) 
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Thus it is not possible to put (6.47) into the form of (6.27) and (6.52) into the 
form of (6.26) with any choice of p. As it turns out that there is no anomalous 
term in the Ward identities for <0|/7(VVV)|0» and that there are anomalous 
terms for <0|7(AAA)|0), it is logical to associate the anomaly with the axial- 
vector current. Thus we fix the momentum routing so that the vector Ward 
identity (6.26) is maintained: 1.e., if p = — 1, the axial Ward identity becomes 

Tivi = 2mT,, € 


— 53 ivei KS: (6.53) 
This corresponds to a modification of the axial-vector current divergence eqn 
(6.23) as 


0A (x) = 2imP(x) + (47) e F (x) F yo(X) (6.54) 


where F,,,(x) is the usual electromagnetic field tensor. This extra term, the 
ABJ anomaly, is thus produced by the renormalization effect and has the 
following properties: 


(1) The anomaly is independent of the fermion masses and should also be 
present in the massless theory. 

(2) Adler and Bardeen (1969) showed that the coefficient in the anomaly 
term is not affected by higher-order radiative corrections, i.e., triangle 
diagrams with more than one loop do not contribute to the anomaly term. 
This can be understood heuristically by noting that the superficial degrees of 
divergence of the higher-order triangle diagrams are less than one and the 
momentum-routing ambiguity does not exist for such diagrams. 

(3) As our presentation has been in terms of momentum routing and 
conventional cut-off regularization, the reader may inquire how this anomaly 
problem rears its head in the dimensional regularization scheme. There the 
problem shows up as the difficulty of giving a proper definition to the Dirac 
ys Matrix in space-time dimensions other than four. 

(4) It was pointed out by Fujikawa (1979) that the ABJ anomalous Ward 
identity could be formulated rather directly in the path-integral formalism. 
He showed that the path-integral measure for gauge-invariant fermion 
theory is not invariant under the y, transformation. The extra Jacobian 
factor gives rise to the ABJ anomaly. 


The ABJ anomaly for non-Abelian cases. In non-Abelian theories, Green’s 
functions with odd number of axial vector couplings up to five-point 
functions contribute anomalous terms to the divergence of axial-vector 
current (Bardeen 1969). However the triangle anomaly may be regarded as 
the basic one since it is the simplest and its absence implies the absence of all 
other anomalous diagrams. In the following we shall continue to restrict our 
discussion to the triangle anomaly. Consider 


Too (ki, k2;q) =i [ate dt x LO TV i) VEAS COO) ef 1 Fa, 
(6.55) 
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where 
Vax) = W(x)T*y, (x) 
g(x) = W(x) Ty, sW (x) (6.56) 


and the 7%s are the internal symmetry matrices. Also consider 


Thy (ky, k23 q) = (asatar Vilx1)Vy(%2)P“(0))|0> er =+ 
(6.57) 
with 
P(x) = W(X) Ty s(x). (6.58) 


The anomaly in the axial-vector Ward identity is 


TR = WMT Envogk@k3.D™ + commutator terms (6.59) 


— In? Lvpo 


where 
D*® — 4 tr({ T", T”) T’). (6.60) 


In the non-Abelian situation the Ward identity usually also involves equal- 
time commutators (see eqn (6.65) below for an example). 


n° — 2y 


An important application of the ABJ anomaly in current algebra is in the 
derivation of the soft-pion theorem for the n? > 2y decay. This amplitude is 
defined as 


CV(K 1€1)¥(k 2&2) | °(q)> = i(2)*6*(q — k, —ky)eh(k, )e5(k2) 
x T(k1, k2, 9) (6.61) 
with 


Falki, k2, q9) =e [ate dry e =t IOTI JODI) 
(6.62) 


which has the Lorentz covariant structure 


Dk, k2, q) = iEuvopk ikh r (q’). (6.63) 


vop 


To derive the low-energy theorem, consider the amplitude 


Favalki, k2,q) = [ats d4y e2 7-0 *0/T( A(x) JS y) J5"(0))|0> 
(6.64) 
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which satisfies the Ward identity 
qT walks, ko, q) = —i | d*x d4y eters 


x {KOT AL) IV") J(0)) 10> 

+ OIT — Yo)LAa(x), J") J7"(0))10> 

+ COIT) Aa), JC) I"(y))IO>}. (6.65) 
From the current algebra of eqn (5.80) the commutator terms vanish. 
Naively one would identify the first term on the right-hand side as the 
n? — 2y amplitude via PCAC (5.181) 
—ie?(—q* + m?) EN 
En a d*x dty e277" 


x O/T (07 AX(x) IO) J7"(0))10>. (6.66) 


Then there should be the soft-pion result of F(q? = 0) = 0 (Sutherland 1967; 
Veltman 1967) since the left-hand side of (6.65) vanishes when q* > OasT,,,, 
does not have intermediate states degenerate with the vacuum and coupling 
to the vacuum through the axial-vector current A}(x). However one must 
include the anomaly term in the Ward identity (i.e., PCAC is modified in this 
case) 


(ky, k3, q) = 


fam; iD ; 
qT yyalky ’ kz, q) = e? (m? — q?) Pik, ’ ky 3 q) oe In? Enveph 1KS (6.67) 
where D is the anomaly coefficient (6.60) (see eqn (6.72)). We then obtain the 
low-energy theorem 


ie? D ses 
ami Faki ’ kz, q) z 2n2f, Enva pk 1K? (6.68) 
or 
e*D 
= ' .69 
NO = 5, (6.69) 


Thus in the soft-pion limit the contribution to the n? > 2y amplitude comes 
entirely from the anomaly (Adler 1969). To calculate D, let us first assume 
the simple quark model (without the colour degrees of freedom) where the 
currents are given by 


Ja Œœ) = 4x), Qax) 
23 
AR) = GOs FA) (6.70) 


with 


zi oe =i] | (6.71) 
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The anomaly coefficient (6.60) takes on the value 


i 4 
=54((0.0)2) =; (6.72) 


yielding T'(0) = 0.0123m,' which is about a factor of 3 smaller than the 
experimental value of T(m ~ 0.0375m,‘. This lends support to the idea 
that quarks carry colour degrees of freedom. The anomaly coefficient D is 
proportional to the trace of the fermion loops and there will be an additional 
factor of 3 coming from summing over the three colours 


(0) = 0.037mz?. (6.73) 


We note that in the above calculation the strangeness flavour does not play a 
role as A3 is nonzero only for the first two components. Physically this. 
corresponds to the statement that the pion is composed of nonstrange 
quarks. Clearly (6.73) is not modified when other flavours (c, b,...) are 
included. 


6.3 Renormalization in theories with spontaneous symmetry 
breaking 


In this section we discuss two topics related to the renormalization of theories 
with spontaneous symmetry breaking. First we study the one-loop renor- 
malization of the simplest 4@* theory (5.131) with spontaneous breaking of 
its discrete symmetry. We show how ‘tadpole diagrams’ contribute to a shift 
in the vacuum expectation value (VEV) of the scalar field and that the 
counterterms are the same as those of the symmetric theory. We next study a 
case of spontaneously broken continuous symmetry and show how the 
Goldstone particles remain massless even in the presence of higher-order 
radiative corrections. 


One-loop renormalization and the VEV shift 


We return to the theory considered in eqn (5.137). The original Lagrangian 


(5.131) 
L = (0,6)? += 126? —< 6" (6.74) 
aes 2 4 
has the discrete symmetry ¢ > ¢' = —@. When p° > 0, this symmetry is 
broken by the vacuum with @ developing VEV 
<0|p|0> = v 
v = (p?/A)?. (6.75) 


Perturbing around this vacuum, we define a shifted field 


bi =g—v. 
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In terms of this field we have the potential (5.137) 
À 
V(o') = pb? + Avg”? + 4 oe” (6.76) 


which corresponds to a scalar with mass 2u? with the self-interacting vertices 


shown in Fig. 6.5. 
Xs Koi 


Fic. 6.5. Interaction vertices for the Lagrangian in eqn (6.76). 


We now study the renormalization effects at the one-loop level. 


(1) One-point function (the tadpole diagram in Fig. 6.6). We have 


T 


FIG. 6.6. 
1 { d*k i 
T = (—6i1v) = | — = 
(=O ls k? — 2p? 
= —3iAvI, (6.77) 
where 
dtk i 
(6.78) 


2 | OE k? = 2p? 

To cancel this divergence we need a counterterm — Dd’ in the Lagrangian 
with 

D = —34vl,. (6.79) 


(2) Two-point function (Fig. 6.7). Diagrams (a) and (b) give rise to self- 
energy terms 


d*k i 


2.0) = (—6id) 5 | 


1 f dtk i \ 
E(0) = (—6idv)? > | a (eae = 18112071, (6.81) 


_{ d*k i : 
umila (eae) e 


where 
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We have 
(0) = X,(0) + 2(0) = —3iAT, + 18iA707J,. (6.83) 


This requires a counterterm — A¢ġ'? with 
3 
A = =Z ål, + 9007s, (6.84) 


(3) Three-point function (Fig. 6.8). 


jl f d4k ees 
r(0) = 3(—61Av)( —61A) 5 lone (r) 


— 54i)2v],. (6.85) 


For this we need counterterm — Bd’? with 
B= -5 (54142014) = 922014. (6.86) 


(4) Four-point function (Fig. 6.9). 


x< A A 


FIG. 6.9. 


EN ER: i : 
r4(0) = 3(—6i)? 5 | nyt fa 


= 54271, (6.87) 


which requires the counterterm —4{C@”* with 
Cx -5 (54i221,) = 92214. (6.88) 
Therefore, the one-loop counterterms are 


5V(b’) = Ab? + BO? + £ o'* + Do (6.89) 
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and the effective potential to the one-loop level is given by 
Vi) = VE) + ôV) = (W? + A)? + (Av + Bjo”? 
+ 44+ Opt + Dd’. (6.90) 
Let ôv be the shift in VEV due to the one-loop contribution 


OV’) 
= 0 
Op’ p= or 


or 
2(u* + A) dv + 3(Av + B)(6v)*? +(A + CXL)? +D=0. (6.91) 
Since dv is small, we can neglect higher-order terms so that 


D —D 


Bs ee toe ea 6.92 
Au? +4) 2p? a 
Thus to eliminate the linear term in @’, we define a shifted field 
go” = d' — Ov (6.93) 
<O0|o”|0> =0 to one loop. (6.94) 
In terms of this new field, the potential can be written as 
VICo" + öv) = ag"? + bb"? +5." (6.95) 
where 
> 3120 
b = Av + B + å ôv = Av + 9*vl, ry 
a= H +A+3 dvdv = u? + 3l, + 927071. (6.96) 
From (6.96) we can check that 
b? —ac = 0. (6.97) 


Throughout the above computation we have consistently dropped higher 
powers of ôv. Eqn (6.97) means that we have 


Vio) = V,(¢" + ôv) = ap”? a (ac)*"? 4 ; 4 


la* af, ON Of a\* 
e os 


This means the effective potential still has reflection symmetry in terms of 


p=" + E (6.99) 
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in spite of the addition of counterterms (6.89). In other words the 
counterterms still have the original symmetry as they reflect the ultraviolet 
properties of the theory which are insensitive to soft symmetry breaking. 
Therefore, even for broken symmetry theory, we only need the counterterms 
of the symmetric theory. Further discussion on this point will be given below 
in the study of U(1) symmetric theory. 


Goldstone bosons remain massless in higher orders 


We now take up theories with continuous symmetries where spontaneous 
symmetry breaking is accompanied by the presence of Goldstone bosons. We 
show how the Goldstone particles remain massless even in the presence of 
higher-order radiative corrections. Again we will illustrate the point with the 
simplest example of U(1) symmetry at the one-loop level. 

The U(1) symmetric Lagrangian (eqn (5.138)) 


2 A 
P = 5 (0,0) +5 (On) + (0? +n?) +7(6? +r)? (6.100) 


when expressed in terms of the shifted field 
o =O-vD (6.101) 
with the VEV 
<O|o|0> = v = (u?/A)? (6.102) 
gives rise to the Lagrangian (eqn (5.146)) 


l 
F = 5 [(0,0')? + (0,n)?] — uo? — dvo'(o”? + r?) 


A 
= 4 (o? + T?) (6.103) 


which has a massless field n and massive (27) field o’ interacting through the 
vertices shown in Fig. 6.10. To see how the n-field remains massless at the 
one-loop level we need to check that the mass renormalization counterterm 
ôm vanishes. This must be so since it cannot be absorbed in any redefinition 
of the physical m-mass. We recall that the self-energy diagrams have the 
expansion 


X(p?) = X(0) + 2'(0)p? + Ep?) (6.104) 
y y 
o’ do’ 
o’ o’ o’ o’ T T 
See <a Ke 
T T o’ o’ T T 


Fic. 6.10. Interaction vertices for the Lagrangian of eqn (6.103). 
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om T 
o' Tt o’ 
. © ¥ O -© 
T T N T T T T T T 
(a) (b) (c) (d) (e) 
Fic. 6.11. Self-energy diagrams for the n-particle. 


where 6m? is identified with £(0). The one-loop diagrams shown in Fig. 6.11 
can be calculated using the Feynman rules of Fig. (6.10) 


X,(0) = (—21Av) (— 614 T L = —3A](2u7) 
a Oe "19 | On) k — 2p? j 
1 ( dtk i 
(0) = (—2id) = | —— = = A2? 
l 1 „lf d*k i 
£Ł.(0) = (maa a a) loan = — ÀI (0) 
1 ( d*k i 
2 = (— 614) = |- 5 = 3A 
a(0) = (—61/) 5 | On? K 3A1(0) 
d*k i i 
EO) = (12i) | —— — — = 2f 2u?) — 
(6.105) 
where the subscripts on the Ès indicate the diagram number in Fig. 6.11 and 
d*k 1 
I(m*) = |\— =. .106 
an lose k? — m? ii 


Thus I (2u*) = il, of (6.78). Again we have the usual symmetry factors 1/2. 
Clearly the contributions to &(0) coming from all five diagrams sum up to 
Zero 


(0) = 2,(0) + 20) + 2.(0) + 2,(0) + 2.0) = 0. (6.107) 


This is the promised result: the n-particle remains massless as required by the 
Goldstone theorem 


Soft symmetry breaking and renormalizability 


We note that the Feynman rule of Fig. 6.10 shows that there are five vertices 
which in turn depend only on the two parameters À and v. This means that 
there are three relations among these five couplings (five Green’s functions). 
These Ward identities are consequences of the original U(1) symmetry. The 
counterterms for these vertices must satisfy the same relations in order that 
they can be absorbed into the redefinitions of o and v (Lee 1972a). This can 
be checked explicitly in the one-loop approximation, as was done in the 
earlier example of discrete symmetry. Again this illustrates that the 
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counterterms in the spontaneously broken theory have the same structure as 
in the corresponding symmetric theory. In fact this is the key point which 
explains why spontaneous symmetry breaking ultimately does not spoil the 
renormalizability of the theory. 

We will amplify briefly the important point that the renormalizability of a 
spontaneously broken theory depends only on the renormalizability of the 
symmetric theory. This is in fact a slightly stronger version of a theorem 
(Symanzik 1970a) which states that soft symmetry-breaking terms do not 
destroy the renormalizability of a symmetric theory. By soft symmetry 
breaking we mean asymmetric terms of dimension less than four, 1.e., they 
correspond to vertices with a negative index of divergence (eqn (2.134)) 


6,=d,;-4<0. (6.108) 
From the result of eqn (2.144) for the index of divergence of the counterterm 


Ôa < ) nô; (6.109) 


and from the fact that symmetry-breaking counterterms can arise only from 
diagrams that involve at least one symmetry-breaking interaction, one 
deduces immediately that the index for asymmetric counterterms must be 
negative 


ô < 0. (6.110) 


We can illustrate this theorem with two simple examples in the U(1) 
theory of eqn (6.100). 


(1) Lop = co, which has dimension one or 6s3 = —3. Thus the only 
counterterm which satisfies ô < —3 is Z3 = — Ao (as the n-term can be 
excluded by the reflection symmetry x > — rn). This does not destroy the 
renormalizability of the theory. 

(2) Lsp = e(o? — x7), which has dimension two or ds, = —2. Since the 
only interactions with dimension less than two or 6,, < —2 are (0° + n?) and 
(o? — T°) (as terms linear in the fields can be excluded by reflection 
symmetry), the renormalizability is again maintained. 


Spontaneous symmetry breaking is clearly of the soft variety as shifting the 
fields only changes the terms with dimension less than four. The remarkable 
point is that these breaking terms not only do not induce asymmetric terms 
having dimension equal to or greater than four but the lower-dimensional 
terms maintain the same algebraic relations as the original theory. Thus, the 
process of renormalization does not introduce additional symmetry break- 
ing, in the sense that symmetric counterterms suffice to remove infinities from 
the theory whether or not the symmetry is realized in the ‘conventional’ or 
Goldstone modes. 
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6.4 The effective potential and radiatively induced spontaneous 
symmetry breakdown 


In previous discussions of spontaneous symmetry breaking (SSB) in field 
theory, we used the classical potential part of the Lagrangian to decide at 
each level of perturbation theory which is the true vacuum. We have to shift 
the field at each order (see calculations in the previous section). We need a 
more systematic method for treating SSB which enables us to survey all 
possible vacua at once, and to compute higher-order correction before 
deciding which vacuum the theory finally picks. The formalism is the 
effective potential (Schwinger 1951a,b; Goldstone et al. 1962; Jona-Lasinio 
1964) and the appropriate approximation scheme is the loop expansion 
(Nambu 1968). Here we follow the presentation of Coleman and Weinberg 
(1973). 


The effective potential formalism 


To illustrate this approach in path-integral formalism, we first consider the 
simple case of one scalar field. The generating functional for the Green’s 
function is given by eqn (1.74) 


WUJ] = [ae exp fi | d*x[L(P(x)) + J t: (6.111) 


We can also think of this as the vacuum-to-vacuum transition amplitude in 
the presence of the external source J(x) 
WEJ] = <0|05,. (6.112) 


When we expand In W[J] in a functional Taylor series in J(x), the 
coefficients will be the connected Green’s functions (1.76) 


n 


In WE J] = er [ats o.. dfx, G(x, «2. X,) I(x)... I(x,). (6.113) 


We define the classical field @, as the vacuum expectation value (VEV) of the 
operator @ in the presence of the source J(x) 


dln W _ ee | 
d(x) | <010> Jy 


The effective action of the classical field I'(@,) is defined by the functional 
Legendre transform 


p(x) = (6.114) 


I'(¢,) = In WL J] — | d*xJ(x)b(x). (6.115) 
From this definition, it follows that 


ôro) _ 
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We can also expand I (¢.) in powers of @, 


I'(¢,) = ee [atm dt x PO (x, 2. X,)OAX1) -.. PAX). (6.117) 


It is possible to show that I(x, ...x,) is the sum of all 1PI Feynman 
diagrams with n external lines. Alternatively we can expand the effective 
action I'(@,) in powers of momentum. In position space this expansion takes 
on the form 


I($.) = | dtxE— Vp.) + 2(6,6.)° ZO) + ---]. (6.118) 


The term without derivatives V(@,) is called the effective potential. To express 
V in terms of 1PI Greens functions, we first write [” in momentum space 


d*k d*k, 
T(x, ...x,)= (my Come C Oks + ka) 
Ce A 2 ke), (6.119) 


Putting this into (6.117) and expanding in powers of k;, we get 


i , (d*k,  d*k, 
Mb) =D [atx dt Omi Qn)* 


. [ats eiki eo hn) X giles ry to ka xa) 
x [T™0, ... Od(x,).-- Phx) +..-] 
4 l n n 
= [a D TOO, ... OPT +... (6.120) 


Comparing (6.118) and (6.120) we see that the nth derivative of V(@,) is just 
the sum of all 1PI diagrams with n external lines carrying zero momenta 


l 
Vib) = -} a P”(0, ... OP.) I". (6.121) 


The usual renormalization conditions of the perturbation theory can be 
expressed in terms of functions occurring in eqn (6.118). For example in Ad* 
theory we can define the mass squared as the value of the inverse propagator 
at zero momentum 


TP (0) = —p?. (6.122) 
Then we have 
d? V 
= ; 6.123 
H doł ME, ( ) 


Similarly, if we define the four-point function at zero external momenta to be 
the coupling constant 


r®(0) = —4, (6.124) 
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then 
dt yV 
À = (1.125) 
dot |ø-0 
Similarly, the standard condition for wavefunction renormalization becomes 
Z(G.) le-0 = 1. (6.126) 


Consider now the SSB of a theory with some internal symmetry. SSB 
occurs if the quantum field @ develops a non-zero VEV even when the source 
J(x) vanishes. In this language with eqns (6.114) and (6.116), this means that 
it occurs if 


ôI (Q.) 
~F 0 (6.127) 


for some nonzero value of ¢,. Furthermore, since we are typically only 
interested in cases where VEV is translationally invariant, we can simplify 
this to 


OV(@-) 
re for $40. (6.128) 


The value of ¢,, for which the minimum of V(@,) occurs will be denoted by 
<>, which is the expectation value of @ in the new vacuum. 


Loop expansion 


To calculate V(ġ.) we need an approximation scheme which preserves the 
main advantage of this effective potential formalism, i.e., the capability to 
survey all vacua at once before deciding which is the true ground state. 
Clearly ordinary perturbation theory with its expansion in coupling con- 
stants is not appropriate as we need to, at each order, identify the true 
vacuum state and shift the field. 

Instead, we will here organize perturbation theory in the form of loop 
expansion. This is an expansion according to the increasing number of 
independent loops of connected Feynman diagrams. Thus the lowest-order 
graphs will be the Born diagrams or tree graphs. The next order consists of 
the one-loop diagrams which have one integration over the internal 
momenta, etc. For the effective potential (6.121) each loop level still involves 
an infinite summation corresponding to all possible external lines. The usual 
classical potential we have been working with is simply the first term (the tree 
graphs) of V(@,) in this loop expansion. In fact the loop expansion can be 
identified as an expansion in powers of the Planck’s constant h. This can be 
seen as follows. Let J be the number of internal lines and V the number of 
vertices in a given Feynman diagram. The number of independent loops L 
will be the number of independent internal momenta after the momentum 
conservation at each vertex is taken into account. Since one combination of 
these momentum conservations corresponds to the overall conservation of 
external momenta, the number of independent loops in a given Feynman 
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diagram is given by 
L=I-(V-—l1). (6.129) 


To relate L to the powers of h, we have to keep track of the factor h in the 
standard quantization procedure. First there is one power of A in the 
canonical commutation relation 


[O(x, t), ny, H] = ih d°(x — y). (6.130) 
This will give rise to a factor of h in the free propagator in momentum space 


dtk „ih 
m Komati CDD 


<OIT((x)$(0))0> = | 


The other place where h appears is in the evolution operator exp[ —iHt/h] 
which gives rise to the operator exp[i/h Jf &,,,(@) d*x] in the interaction 
picture. This means that there will be a factor of 1/h for each vertex. Thus, for 
a given Feynman diagram, we have P powers of h with 


PefSV2L21, (6.132) 


Thus the number of loops and the power of h are directly correlated. The 
statement that loop expansion corresponds to an expansion in Planck’s 
constant is really a statement that it is an expansion in some parameter a that 
multiplies the total Lagrange density 


Lo, ô p, a) = a7 L(, 8,4). (6.133) 


The above counting of the h powers (P ) reflects the fact that while every 
vertex carries a factor a~ +, the propagator carries a factor a because it is the 
inverse of the differential operator occurring in the quadratic terms in Z. 
Because hh, or a, is a parameter that multiplies the total Lagrangian, it is 
unaffected by shifts of fields and by the redefinition or division of ¥ into 
free and interacting parts associated with such shifts. In short it allows us to 
compute V(@,) before the shift; thus it is an appropriate perturbation 
scheme for our purpose. 

We should remark that this loop expansion is certainly not a worse 
approximation scheme than the ordinary coupling-constant expansion 
perturbation theory since the loop expansion includes the latter as a subset at 
a given loop level. In fact if we fix the number of the external lines (which we 
do not in the calculation of V(@,)) these two expansions are identical for the 
simple case of one coupling constant. For example, in A¢* theory we have 
(eqn (2.58)) for Green’s functions with E external lines 


4V =E+2I (6.134) 


which can be converted into a relation between the powers of the coupling 
constant (V) and the number of loops by using (6.129) to eliminate Z 


ee) ae ee (6.135) 
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The effective potential of the 1¢* theories 


We now illustrate the calculation of the effective potential in the simple case 
of Ap* theories in the one-loop approximation. The Lagrangian density is 
given by 


with 
= 30,0)" — iuo? 
A 
Lı = Al pt. 


We shall study the three cases corresponding to u? > 0, u? < 0, and p? = 0. 


(1) W? >0 case (no SSB). To calculate the effective potential in eqn 
(6.121), we must sum all one-loop diagrams with an even number of external 
lines having zero momenta (see Fig. 6.12). 


5 0: Š k: 


The 1PI Green’s function is given by 


d*k l ý 
Te? 0, ss 0) = iS, p [iw p g (6.137) 


where S, is the symmetry factor 


(2n)! 
n= n (6.138) 
corresponding to the fact that there are (2n)! ways to distribute 2n particles to 
the external lines of the diagram and that interchanges of any two external 
lines at a given vertex or reflections and rotations of n vertices on the ring do 
not lead to new contributions. The no-loop and one-loop effective potential 
is then given by 


dk e if a T 
One > Qn} k? — p? A] 


dtk Ag2/2 


l À 
Vib.) = 51°82 +5 #8 - 
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The integral is divergent. If it is cut off at some large momentum, we obtain 


l À A? A 
V(b.) = 5 o? + Ji pé + 3972 G at 5 $2) 


l kar Irre 1 


The appearance of the combination u? + (A/2)¢@2 results from the summation 


l pel olea lE ] 
kawr kPa kpo Er Ao) 


(6.141) 


i.e., the 4ġ2/2 term acts effectively as a mass insertion (for further remarks 
see eqn (6.147) below). 

To remove the cut-off dependence we introduce counterterms which have 
the same structure as the original potential 


A B 
Valp) = 5 02 + 5 08. (6.142) 


so that the renormalized effective potential, given by 


VA) oc V(o,) + Vil@.), (6.143) 


is finite and cut-off-independent. The coefficients A and B in (6.142) can be 
determined by the renormalization conditions (6.123)-(6.126). In this way we 
have 


l A ] A u? + Ad2/2 
Vi yg ht 247 2/2] c 
Ap 3 
-5 -7 268 |: (6.144) 


We see that the large @, behaviour of V(@,) is modified by radiative 
corrections. 
(2) x? <0 case (with SSB). For illustrative purposes we choose this time 
to separate the Lagrangian differently 
L = 7(0,¢)" — UP) (6.145) 
where 


l 
Ud) = 50? +5 6 (6.146) 


and take U(¢) as a perturbation. There are two vertices: u? and 447, shown 
in Fig. 6.13(a). Their combination is the second derivative U"(@) which is just 


—e— ieee See ee A C++ OC +e 


(a) (b) 
Fic. 6.13. 
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the scalar mass squared for @ = <>. Thus we use the notation 
m3(d) = K? + 5A’. (6.147) 


The effective potential calculated by summing up the diagrams in Fig. 
6.13(b), with a massless propagator and m2(q) vertices, has the form 


_! , 4 d*k < m; (de) 
V(¢,) > wpe tyi gi + i lone 2 lowe aI 


Ll gas Ag DOK m2(¢,) 
= 5H Q: + 7% +3 {soe Ut aa 


16244 oe PAS yy OD 


ai (6.148) 


We see that, apart from an irrelevant constant, this is the same result as case 
(1) with u? > 0. Thus, if we choose the same renormalization condition as in 
case (1), we arrive at the renormalized potential (6.144). In this case the 
quantity x? has a different physical meaning and is not the mass of the 
particle in the zeroth order. Nevertheless it is still a finite quantity which 
serves to parametrize the theory. 

The calculations in cases (1) and (2) illustrate the advantage of this 
perturbative approach: the same result can be obtained whether there is SSB 
or not; we need not shift the field beforehand. Also, this approach is 
insensitive to how we divide up the Lagrangian, whether as in (6.136) or 
(6.145). More importantly, the calculations illustrate the feature that even in 
the presence of SSB the counterterms are still the same as those of the 
symmetric theory. In other words the ultraviolet divergences respect the 
symmetry of the Lagrangian even if the vacuum does not. This reflects the 
fact that SSB is generated by the nonzero VEV <@> which has the dimensions 
of mass and the ultraviolet divergences are insensitive to finite mass scales. 

(3) x? = 0 case (SSB driven by radiative corrections?). In this case V(@) is 
flat at ġ = 0 and the usual procedure of using the classical potential is 
inadequate to determine whether SSB is induced or not, and we have to go 
to higher orders to see the pattern of symmetry realization. Coleman and 
Weinberg (1973) were the first ones to point out this interesting phenomenon 
of radiatively induced SSB. We cannot take the u? — 0 limit of eqn (6.144) 
because of the infrared singularity. To get around this difficulty, we will 
choose a renormalization condition for the coupling constant different from 
that of (6.125). Instead, at d, = 0, we have 


d+V 
en (6.149) 
dot lo--m 


where M is an arbitrary mass parameter. The effective potential now takes, 


196 Renormalization and symmetry 6.4 


for u « M, the form 


i À 


l u? + Ad2/2 
ue z(e Ez sé) 1 a ee | 


l. 242 2 aaa , Laoag 2u? 
5 AH h 544 Pe + ZA oe In IVE (6.150) 


This has the p? = 0 limit 


l Mod 2 25 
V$.) =7i Pe in Pc -3| (6.151) 


2567 M? 6 


Since the second term in eqn (6.151) is negative for small @,, it has the effect 
of turning the zeroth-order minimum at the origin into a local maximum 
and producing a new minimum at some point away from the origin. In other 
words, the one-loop correction has generated SSB. However this conclusion 
is unwarranted as the new minimum is located at 
g 722 

‘or = oe + O(A). (6.152) 
The fact that A In(<o?>/M7) is bigger than one (i.e., the loop contribution 
is larger than that of the tree) means that the new minimum lies outside 
the validity of the one-loop approximation. In this simple theory with one 
coupling constant such a result is inevitable. Since we want the one-loop con- 
tribution to compete with the three contribution, A In <ġ°> must be large, yet 
A is the only parameter in the theory. This implies that to avoid this 
difficulty we should examine theories with more than one coupling. 


Aln 


Massless scalar QED (dimensional transmutation) 


The Lagrange density is given ” 


l y A 
L = -3 FF + la, - ied JOP -EPP (6153) 
where A, is the photon field with F,,, = 0,A, — 0,A,. @ is a complex scalar 
field 


@ =o, + igo, (6.154) 


where @, » is real. Now we have two coupling constants å and e and it will be 
possible to obtain a small expansion parameter. 

The calculation will be considerably simplified in the Landau gauge, where 
the photon propagator is 


; Guy E k„k,/k? 


o a k? + ig 


(6.155) 
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Because we always work with zero external momenta and because k“ A,,,(k) 
= 0, all graphs of the type shown in Fig. 6.14 make no contribution. Also 
note that the effective potential can depend only on ¢2 = o*¢, = $7. + Ox. 


= 


Fic. 6.14. 


We need only compute diagrams having @, external lines; there are three 
basic classes of graphs with different particles running in the loop (Fig. 6.15). 


Fic. 6.15. 


In this way we obtain 


1, of (AN 1\? 2 9 
V(b.) = 5 44 + £5 4(3) f mn G) l $ 3e} in 7 -? | (6.156) 


The calculation is entirely analogous to the previous case; we only need to 
note that the 73 coupling is 1/3 of the @{ coupling because of the different 
Wick contractions and that the extra factor 3 in the photon loop comes from 
the trace of Landau-gauge propagator. If we assume that å is of order e*, we 
can neglect the 4? term in (6.156). Since M is arbitrary, we take it to be the 
actual location of the new minimum, M = <@). The effective potential 
becomes 


l 3e4 p? 25 
V = — age + — a ener ey A 
We also have the relation 
À llet : 
or 
33 
A= oe. (6.158) 


Surprisingly, the two independent coupling constants are related. This can be 
understood from the fact that we start out with two dimensionless para- 
meters e and A; we end up also with two parameters e and <@>. In other 
words we have traded a dimensionless parameter A for a dimensional 
parameter <@>. This has been called dimensional transmutation (Coleman 
and Weinberg 1973) which is a general feature of the theory without any 
mass scale. Changes in M always involve a new definition of the coupling 
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constant. With (6.158), the potential in (6.157) can be written simply as 


4 2 
V(o.) = a ge n( = 5) (6.159) 


which has a minimum at ġ. = <@> and symmetry is spontaneously broken. 
This time the loop expansion is valid as the one-loop contribution can be 
smaller than that of the tree. Incidently since the mass of the ‘photon’ after 
SSB, m, = e<(@)>, we may use the notation 


m (p) = ed (6.160) 
and write (6.157) as 


V(o.) = (6.161) 


LLE 


64n? <b>? 2 


This is to be contrasted with eqn (6.148) for the scalar loop. 

We should also remark that, if there are fermions in the theory, there will 
be, in addition to the scalar and vector loops, fermion loops. We can 
calculate their contribution in an analogous manner. If we follow the 
procedure of case (2) above and work with massless propagators, we will 
have the general vertex m(ġ) = m + hd, where m is the bare fermion mass 
and h is the Yukawa coupling. Since the trace of an odd number of Dirac 
y-matrices is zero, we must have an even number of fermion propagators. 
Then we can group terms as 


l l l 


Then the calculation becomes exactly the same as the scalar case except the 
important difference of an overall minus sign for the loop integral. And we 
have the fermion loop contribution 


—4 


E mi(¢,)[In 62/M? +...]. (6.163) 
64r 


ôV ($.) = 


The factor of 4 arises from the trace of Dirac matrices. Thus in a theory 
having vector, scalar, and fermion loops, we can combine (6.161), (6.148), 
and (6.163) to obtain the one-loop contribution 


l 
SVO) = ng BMH) + miD) — AmA] In $2/M? +... 


4r? 
(6.164) 


7 The parton model and scaling 


LEPTON—NUCLEON scatterings at high energy and large momentum transfer 
exhibit the remarkable phenomenon known as Bjorken scaling. This 
correlation pattern of the energy and angular distribution of the scattered 
leptons in these deep inelastic processes can be described simply by 
Feynman’s parton model (Feynman 1972). At short distances hadrons may 
be viewed as composed of (almost free) point-like spin 1/2 constituents, the 
partons. It is natural to identify them as quarks. After a description of 
the quark—parton model and some of its applications in the first two sections 
we will then present the formal field theoretical apparatus required to 
describe the short-distance behaviour. This is Wilson’s operator product 
expansion (Wilson 1969) with coefficients satisfying the renormalization 
group equations. Thus the stage is set for a field theory of strong interactions 
with the quarks being identified as the fundamental matter fields. 


7.1 The parton model of deep inelastic lepton—hadron 
scattering 


Kinematics and Bjorken scaling 


The leptons used in deep inelastic processes are either charged leptons 
(electron or muon) or neutrinos which scatter off the target nucleons via the 
electromagnetic or weak interactions, respectively. 
Electron—nucleon case. The momenta for the reaction 
e(k) + N(p) > elk’) + X(p,) (7.1) 


are shown in Fig. 7.1, where X is some hadronic final state with total four- 
momentum p,. 


k’ 
e(k) 


N) X(p,) 
Fic. 7.1. 


We define the kinematic variables by 
q=k—k', v=p-q/M, W° =p, =(p+4q)’ (7.2) 
In the lab-frame we have 


p, = (M,0,0,0), &k, =(E,k), k, = (Ek). (7.3) 
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Then 
v= E-E (7.4) 


is the energy loss of the lepton and, when the lepton mass m, is neglected 
= (k —k')? = —4EE'’ sin? ; <0, Q= =g? (7.5) 
where 0 is the scattering angle of the lepton. The amplitude is given by 
T, = ?ülk', A’)y"u(k, A) a <ni Ja (0)|p, o> (7.6) 


where Jọ” is the hadronic electromagnetic current. The unpolarized differen- 
tial cross-section is given by 


1 1 1 œK k d? 
TA T E nl aoe 


v| 2M 2E (2n)32ky £; (27)°2Pio 
x — = |T,,|7(27)* ôt(p + k — k' — p,). (7.7) 
4 aa 


where p, = X}~, pi. If we sum over all possible hadronic final states (1.e., they 
are not observed) we obtain the inclusive cross-section 


d?o a? (E' 
dQdE’ qf (=) ee 


where a = e?°/4r is the fine structure constant. The leptonic tensor corres- 
ponds to 


r 5 Æp ky.) = Re eee , PE (7.9) 
and the hadronic tensor is given by 
l H d°p; 
Wl? D = gag dd | L EA 
x <p, a|J(O)in><n| JT Op, X27)? 54(p, — p — q) 
: ca BX =p, ol JJP Op, 0). (7.10) 
= i p p, o x p,o . 


Sometimes it is more convenient to rewrite this in the form of a commutator; 
we observe that 


d*x 1q’ x 
S ep, aLI OJP, 0) 


= -jE et POX p, o| J (0) <a J, Op, o> 


= ¥ (22)? 5*(p, — p + Op, alJ,(O)\n><n| J (Op, o). 
(7.11) 
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In the lab frame qọ = v > 0 there is no intermediate state |n> with energy 
E, = M — v < M which can contribute; thus the above term vanishes. We 
can therefore write 


WlPod)= gaz | Gee <p APOO (7.12) 
From current conservation 0"J{" = 0 we have 
q'<p, oF, n> = 0 
or 
Wua = W,, = O. (7.13) 


From (7.13) and the fact that W,,, is a second-rank Lorentz tensor depending 
on the momenta p, and q,, one can deduce its covariant decomposition 


Guy 
W(P, q) = |- W, @ aa a ) 


W, pq pq 
+ M2 (p, = Pa an)(, rs qe dy (7.14) 


where W, 2 are Lorentz-invariant structure functions of the target nucleon 
depending on the invariant variables q? and v of (7.2) and (7.5). We can then 
write (7.8) 


d2 2 0 0 
oak a (2, sin? ae cos? 5) (7.15) 


A measurement of the inclusive cross-section yields information about the 
structure functions W, 5(q’, v) which are the strong-interaction quantities 
characterizing the response (and hence the structure) of the target nucleon to 
electromagnetic probes. 

To get some feeling about the structure functions, we first consider the 
special case where the final hadronic state X(p,) is also a nucleon. The matrix 
element of the electromagnetic current between either the proton or the 
neutron states can be written as 


(NCPE COIN) > = (PLY F 14") 
+ io ,q’F 2(q°)/2M Ju(p) (7.16) 


with q = p — p’. F, 2(q°) are Lorentz-invariant form factors. For the case of 
the proton, F$(0) = 1 and F5(0) = 1.79 (nucleon magnetons) measure the 
total charge and anomalous magnetic moment, respectively, of the proton; 
F’(0) = 0 and F5(0) = —1.91 measure the total charge and anomalous 
magnetic moment, respectively, of the neutron. To check that F)(0) = 1 
does give the total charge of the proton as +1, we note that the charge 
operator 


O|p> = |p> (7.17) 
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implies 

<P'\Qlp> = <p’ |p) = 2E(2n)* 6°(p' — p). (7.18) 
On the other hand from (7.16) we have 


(P'|Olp> = [axir 


_ | d3x e- p'| JE™(O)|p) 


= (27)? 6°(p’ — p)i(p’)you(p)F ; (0) 
= 2E(2n)? 53(p’ — p)F, (0). (7.19) 


Thus (7.17) implies that F,(0) = 1, as promised. It is straightforward to take 
the elastic limit p? = M? in (7.10) and obtain 


2 
we 2 — 6 2 IM ee 2 
1(q°, v) = 0(q* + BETT m(q°) 


W5(q’, v) = 6(q? + 2Mv) 


2M 2 Ge as 
a= gram | Ol = 4M? Gu(q J 


(7.20) 
where 
ZnS 2 q? F 2 
GT) = AG) + a 2(q°) 
Gu(q*) = Filg?) + Flg’) (7.21) 


are the electric and magnetic form factors, respectively. The elastic electron— 
nucleon cross-section is then 


do“ a 
dQ 4E? 
0 0 
cos? 5 il - q’/4M*) EGE — (q7/4M’)Gu] — sin? 5 (q°/2M*)Gi; 
i 0.0 
1 + (2E/M) sin 5 sin 5 


(7.22) 


Thus measurements of the elastic eN differential cross-section yield infor- 
mation about the electric and magnetic form factors. Experimentally Gg and 
Gy, for the proton are given by (for a review see Taylor 1975) 


Gu(q’) = l 
Kp (1 — q?/0.7 GeV’)? 


Glg’) © (7.23) 
where k, = 2.79 is the magnetic moment of the proton. If the proton were a 
point-like (structureless) particle, we would have Gy(q*) = G;(q7) = 1. Thus 
the nontrivial dependence of q? in (7.23) indicates the structure of the proton. 
Also for large q?, the elastic cross-section falls off rapidly as Gg Gy ~ q’*. 
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If the inelastic cross-sections for final states other than the nucleon all 
behaved much like the elastic cross-section, we would expect them to fall off 
rapidly for large q°. The surprise is that experimentally these cross-sections 
for large final-state invariant mass W > M seem to have a much more 
moderate dependence on q (for a review see Panofsky 1968). This leads to 
the idea that there must be some point-like constituents inside the nucleon 
much as the large-angle scattering of the a-particle in Rutherford’s 
experiments suggested that the charge of the target atom was concentrated in 
the ‘point-like’ nucleus. These structureless particles inside the nucleon are 
called partons. A proper description of the parton model will be given after 
we have made a more precise statement of the deep inelastic scattering 
behaviour in terms of Bjorken scaling (Bjorken 1969). 

Define the dimensionless scaling variables 


2 2 
-¢ Q 
= = 24 
2Mv 2Mv 29) 
The range of x 
O0<x<l (7.25) 


is given by the fact that the invariant mass of the unobserved final hadronic 
state is larger than the nucleon mass 


W? =(p+q) =q°+2Mv+ M?>M?. (7.26) 
Note that the elastic scattering corresponds to x = 1. Also define the variable 
v E’ 
ETE eee 
y=5=l-} (7.21) 


which is the fraction of the initial energy transfered to the hadrons. From the 
fact that 0 < E’ < E we obtain the range of y 


O<y<l. (7.28) 


It is convenient to express the cross-section in terms of the x and y variables. 
Using the relation 


dxdy = a Saal (7.29) 
and the definitions 
MW, (g, Y) = Fi(x,q°/M°) 
vW2(q°,v) = Fa(x, q /M°), (7.30) 


we can write (7.15) in the form 


d?o Sma" ; M 
Sees, sare = aa a |: 731 
o uay E F, + (i y= ») | (7.31) 


Bjorken scaling is the statement that in the large Q? limit with x fixed, the F;s 
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are functions of x only. Thus, 


2 
q 
lim F;| x F(x 7.32 
lim (x fz) - 0). (7.32) 
x fixed 
The dimensionless structure functions become independent of any mass 
scale. The F;(x)s are called the scaling functions. Experimentally Bjorken 
scaling seems to be obtained for a rather modest value of Q? > 2(GeV)” in ep 


scattering. 


Neutrino—nucleon scattering. Next we come to the case of the charged 
weak-current process 


vi(k) + N(p) > I (k’) + X(p,)- (7.33) 


Since the basic idea is exactly the same as for the electromagnetic IN 
scattering considered above and since the reaction has also been presented in 
our discussion of the Adler-current-algebra sum rule in §5.2, we merely 
summarize the results. We will assume the current—current interaction for the 
weak effective Lagrangian 


G 
Pie ap J} J* + hic. (7.34) 


where Gp is the Fermi constant. The (charged) weak current J* can be 
separated into the leptonic and hadronic parts 


Nae ee a (7.35) 
The leptonic weak current has the explicit form 
Ji =v." — ysle+ VA — ysu +... (7.36) 
The cross-section for neutrino and antineutrino scatterings can be written as 
d?o™ G? 20 0 (E+E ) G; 
SE a o (v), 27 yw) (v) 
dO dB’ ~ 277 | sin? zri + cos a a M ad 


(7.37) 


dan GFE a 6. 0 (E + E' E 
ay F’ (¥) 2 ` pw) (¥) 
nde ~ 2 2 sin Mea ama, now? 
(7.38) 


where the structure functions W“ are defined as 


v d*x iq:'x 
Wo = | E Ap, oll Jug). ILOP, 0) 


= IN 4 Fup ot WYD.P,/ M? — iW? eapy6P'Q’/ M? 
W2da9p/M* + WO(Padg + Peda)/M? 
+ t a a Ppqa)/M? ° (7.39) 
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The Ws can be obtained from (7.39) using J,, <> J}, (also see §5.2). We also 
define dimensionless structure functions 


MWg’, v) == G(x, q’/M’) 
vWY(q?, v) = GY(x, q?/M’) 


vW3(q?, v) = G3(x, q?/M”*). (7.40) 
These structure functions will also satisfy the Bjorken scaling 
lim G(x, q?/M?) = F(x). (7.41) 
lq2|— œ 


x fixed 


It is often useful to have structure functions with definite helicities. They can 
be obtained as follows. In the lab frame, choose the z-axis such that 


Pu = (M, 0, 0, 0) and du mF (qo, 0, 0, q3). (7.42) 
The longitudinal polarization vector is then of the form 
l 
EP = z (4s, 0, 0, qo) (7.43) 


and the corresponding structure function is 


W5(q’, v) = ep We, 
2 
=—W,- z W, = (1 — v’/q’)W, - W, (7.44) 


where we have suppressed the neutrino superscript (v). The right- and left- 
handed transverse polarization vectors are 


] 

e8 = T (0, 1, i, 0) (7.45) 
] 

eP = RT (0, 1, —i, 0) (7.46) 


and their structure functions are 


l i 
WR=W,t+ aap —q°)W; 


l : 
Wi = W, -zg 0? — PW, (7.47) 


Note that these structure functions with definite helicities, W,, Wp, and Wg, 
have to be positive. In the scaling limit, the following helicity functions are 
functions of x only 


l 
2MWs > Fs =— F, — 2F, 


MW, > F, =F, — 3F;3 
MW, > Fp =F; + $F3. (7.48) 
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The differential cross-section can be written 


d?o™ MEx eae ? 
Gad a 
d?o ME : - ; 

"= G2 “Tl — FY + (1 — yp FO + FL]. (7.50) 


dx dy T 


Note that these equations imply that the total cross-section will grow 
linearly with energy. (This is also the typical behaviour of the neutrino 
scattering off the point-like lepton.) That this is indeed the behaviour 
observed experimentally also suggests that there are point-like constituents in 
the nucleon. 


The parton model 


We shall now calculate the lepton—nucleon structure functions in the parton 
model (Feynman 1972; Bjorken and Paschos 1969) which is the subnucleon 
version of the familiar impulse approximation of high-energy scattering of 
composite particles with weakly bound constituents. The inclusive scattering 
is viewed as due to incoherent elastic scattering from point-like constituents 
of the nucleon, the partons, depicted in Fig. (7.2). The final-state partons 
then recombine (fragment) somehow into hadronic states. Thus we are mak- 
ing the physical assumptions that (1) during the time of current-parton 
interaction one can ignore interactions among partons themselves and (2) 
the final-state interactions (necessary for partons to fragment into hadrons) 
take place on such a relatively long time-scale that they can be ignored 
in the calculation of the inclusive cross-sections. For a more detailed 
presentation the reader is referred to Close (1979). 

Specifically, each of the spin-1/2 partons is hypothesized to carry a fraction 
of the original nucleon momentum ép with 0 < € < 1, i.e., we neglect any 
parton momentum transverse to the nucleon momentum. Then the contri- 
bution to the hadronic tensor (7.10) from such a spin-1/2 parton can be 
immediately worked out as 


l d?p' 
— 4M 2 (2n)>2p5 
x <Ep, o| S(0)|p’, o’><p’, o|J(0)|Ep, X27)? 54(p' — Ep — q) 


K,,(6) 


1 
= M 2, aépyy,u( palp y u(Ep) Ôlpo — €Po — 40)/2p0. (7.51) 


spin 


FIG. 7.2. Inelastic lepton-nucleon scattering as incoherent elastic scattering from partons. 
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The factor of € in the denominator appears because of the change in the 
relative flux from p to ép. The delta function may be written as 


(Po — Epo — Fo)/2Po = 9(Po) SLPo — (EPo + 4o)*] 
= 0(po) d[p’* — (Ep + q)] 
= A(Epo + qo) (2M vE + q’) 


5(E— x) 
2Mv 


= (Po + qo) (7.52) 


For the spin sum, we have 
lo- : 
5 2 WEp)yu(ep + Dulép + ayulép) 
spin 


= $ [pn ÉP +a] 


= 2éLp (Ep + a)y + (Cp + )uPv -p (Ep + Qguy] 
= 4M*&(p,p,/M’) — 2Mveguy +... (7.53) 


where we have neglected the parton mass. The parton tensor (7.51) is then 


] 
K„(&) = ô — (22h -zlet ) (7.54) 


Let f (&) dé be the number of partons with momenta between € and č + dé 
(weighted by the squared charge). Then we can calculate the hadronic tensor 
in terms of an integral over K,,,(¢) 


1 


Ws = [LOKE d¢ 


(0) 


x) PuPy S) 


In this way the delta function that enforces the mass-shell condition of the 

final parton leads to the structure-function dependence on x = —q?/2Mv 
alone, 

MW, > F,(x) =3f (x) (7.56) 

vW, > F(x) = xf (x). (7.57) 


Thus the scaling functions of (7.32) are measures of the momentum 
distribution of the the parton in the target nucleon. 
We also note, from eqns (7.56) and (7.57), that 
2xF (x) = F(x) (7.58) 


which is known as the Callan—Gross relation (1969). This is a direct 
consequence of the assumed spin-1/2 nature of partons. For example if we 
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had used spin-0 partons, we would have had 
Kay © <xplJylxp + 9><xp + q\J,\xp> 
oc (2xp + q),(2xp + q)y. 
With the absence of a g,,, term this would lead to 
F(x) = 0. (7.59) 


There is a simple interpretation of (7.58) and (7.59) in terms of the helicity 
structure functions. If we define photon polarization vectors as in (7.43) and 
(7.46), we have eqn (7.48) 


| 
F; = > F, —2F, (7.60) 
Fr = F + Fer = 2F. (7.61) 
Thus (7.58) and (7.59) can be translated into 
FF; = 0 for a spin-1/2 parton (7.62) 
FF, =0 for a spin-0 parton. (7.63) 


To see that these results follow from angular momentum conservation, we go 
to the Breit frame of reference in which the momentum of the parton just 
reverses its direction without changing its magnitude upon collision with the 
virtual photon 


qu = (0, 0, 0, —2xp), 


xp, = (xp, 0, 0, xp), 


If the parton has spin-0, only the virtual photon with zero helicity (e) can 
contribute while the helicity +1 states (et, €?) do not conserve angular 
momentum along the direction of motion. On the other hand, for the spin- 
1/2 parton (with negligible mass) the spin component also gets reversed upon 
collision and this will require the virtual photon to be in the +1 helicity state; 
hence F, = 0. Experimentally (7.58) or (7.62) is reasonably satisfied in the 
scaling region and we can conclude that nucleons do indeed have charged 
spin-1/2 point-like constituents. 


7.2 Sum rules and applications of the quark—parton model 


It is natural to identify these charged spin-1/2 partons with the quarks which 
were first invented to account for the spectroscopic properties of hadrons. 
Eventually we shall develop a field theory of the strong interaction, quantum 
chromodynamics (QCD), which is a non-Abelian generalization of the 
familiar theory of quantum electrodynamics (QED). In QCD the quarks, like 
the electrons in QED, are the basic matter fields with interactions mediated 
by the electrically neutral vector fields, the gluons, much as the photon 
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mediates the electromagnetic interactions among electrons. With this picture 
we can also have a qualitative understanding of the form of the parton 
distribution function f(x). Experimentally it has the shape shown in Fig. 
(7.3). To understand this we follow the presentation by Close (1979) and start 
with a primitive model of three free quarks of the nucleon (see Fig. 7.4) for 
which the parton distribution function is essentially a delta function at 
x = 1/3, i.e., f(x) ~ d(x — 1/3). We turn on the interaction of the quarks 
with gluons; this distribution will be smeared by the gluon exchange be- 
tween quarks (Fig. 7.5). Then, just as the case of QED where the virtual 
photon (emitted with a bremsstrahlung momentum spectrum of dk/k) can 
create e*e pairs, the gluons (emitted with a probability ~dx/x) can pro- 
duce qq pairs. These processes of internal conversion and bremsstrahlung 
will produce a ‘qq sea’ at small x to give the final distribution (Fig. 7.6). 
We would like to see whether the high-energy lepton—nucleon scattering 
data are consistent with the quark quantum number fixed by the spectro- 
scopic phenomenology. Working with the quark model with only light 


I(x) 


l x 
Fic. 7.3. 
l 
FIG. 7.4. 
l 
FIG. 7.5. 


Fic. 7.6. 
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quarks u, d, and s, the hadronic electromagnetic current has the explicit 
form 
J, = 


y,u — 4dy,d — 38y,s (7.65) 


2 Ss) 
3 
and, below the charm threshold, the hadronic (charged) weak current has the 
form 
Ji = uy*(1 — y;)(d cos 0. + s sin 6,) (7.66) 


We shall work in the approximation of vanishing Cabibbo angle (0, ~ 0); 
hence 
Ji x uy" — ys) d. (7.67) 


Here we have used the quark labels as particle field operators. In the 
following we shall use these names to denote the quark—parton distribution 
function instead. With the squared electric charges factored out explicitly, we 
have, through eqns (7.56) and (7.61), 


FP = f(x) = $u, + ū,) + o(d, + dp) + a(S, + §,) (7.68) 
FP (x) = f(x) = (Un + Oy) + (dn + dn) + 5(Sn + 5n) (7.69) 


where qy(x) denotes the probability of finding a parton with longitudinal 
momentum fraction x carrying the quantum number of quark q in the target 
nucleon N. They are constrained by the quantum number of nucleon. For 
example, 


Isospin: . 
; | (u0) = B0] = Eds) = 4,001} dx = 5 (7.70) 
Strangeness: . 
| [s,(x) — 3,(x)] dx = 0 (7.71) 
Charge: 


1 


1 
[ Fae — ajo) dx |5 a = 60) ax 
(0) (0) 


1 


— | : [s,(x) — 8,(x)] dx = 1. 
0 


Using isospin symmetry (i.e., the invariance under the interchanges pn 
and u d), we have 


u(x) = d(x) = u(x) 
d(x) = u(x) = d(x) 
S(X) = Sy(x) = s(x). (7.72) 
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Thus (7.68) and (7.69) may be written 
FP(x) = f(x) = $u + Ñ) + (d + d) + os + 5) (7.73) 
FẸ (x) = f(x) = $d + d) + Gu + ü) + $(s + 5) (7.74) 
The ratio of the proton and neutron structure function is then 
FF(x) 4u +ū)+ (d+ d)+ (s +5) 
F(x) (u+t)+ 4d + d)+ (s +85) 
Since all q(x)s are positive, we must have the bounds (Nachtmann 1972) 
1 Fr(x) 
-< <4 . 
4E RO S e 
which is in fact consistent with experimental data. 
Furthermore, our discussion of the quark-parton model at the beginning 
of this section also suggests (e.g., Kuti and Weisskopf 1971; Landshoff and 


Polkinghorne 1971) that the quark distribution function can be usefully 
decomposed into valence quarks and sea quarks 


q(x) = q(x) + q(x). (7.77) 


The presence of the valence quarks is already indicated by the original quark 
model. Thus protons and neutrons have valence quarks of (uud) and (udd), 
respectively. The sea quarks correspond to those quark pairs produced by the 
gluons: they are symmetric with respect to the flavour SU(3) group and, as 
indicated by our discussion above, they should be concentrated in the small-x 
region. For the proton target we have 


u(x) = 2d,(x) 
s(x) = U(x) = d(x) = 5x) = 0 
u(x) = ū(x) = d(x) = d(x) = s(x) = §,(x) = G(x). (7.78) 
Thus eqns (7.73) and (7.74) may be written as 
FP(x) = 7u(x) + G(x) 


(7.75) 


FP (x) = 3u(x) + 3G(x). (7.79) 
Their difference directly measures the valence quark distribution 

FP(x) — Fr(x) = guy) (7.80) 
which should have a peak approximately around x = 1/3, as suggested by 


Fig. 7.5. 


Also, the experimental observations 


l asx—-0 
p n 
FRG)/FR@) +14 Saa (781) 
supports the expectation that G(x) is important only in the x — 0 region, and 
the feature that the valence distributions dominate in the x — 1 region with 
w(x) > d (x). 
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For the neutrino—nucleon structure functions, corresponding to eqns 
(7.73) and (7.74), we will merely list the basic results 


FP(x) = 2d(x) FP) = 2u(x) 

FR(x) = 20(x) FR(x) = 2d(x) 

F(x) = 0 F(x) = 0 

F(x) = 2u(x)  FE(x) = 2d(x) 

FP) = 2d(x) FR (x) = 20(x) 

F(x) = 0 F2(x) = 0. (7.82) 


The functions on the right-hand sides are distribution functions for proton 
targets and the factors of 2 reflect the presence of both vector and axial- 
vector parts in the weak currents. We can then isolate the strange-quark 
distribution as follows. Using F, = x(F, + Fp + Fs) and (7.82), we have 


FY(x) + FY(x) = 2x(u +ü +d + d). (7.83) 
From F, = xF; and eqns (7.73) and (7.74), we have 
FP(x) + Fo(x) = x[5(u + ū + d + d)/9 + 2(s + 8)/9]. (7.84) 
They imply that 


F?(x) + F(x) — > [FP(x) + FY] = z [s(x) + 5(x)]. (7.85) 


The experimental data are consistent with a vanishing right-hand side, except 
for the small x (<0.2) region. In other words the strange quark and 
antiquark content of the nucleon is very small. Furthermore, if we assume 
that the sea-quark distributions are SU(3) symmetric, the ū(x) and d(x) 
contents should also be small. 


We next consider a number of sum rules; their validity strongly supports 
the quark—parton picture we have presented. 


The Adler sum rule 


This sum rule has already been derived in our discussion of current algebra. 
Eqn (5.109) takes on the following (8, =0) form in the scaling limit 
—q*? > œ, v > 0, with x fixed 


i dx 
| =, LEF) — FP@)] = 2. (7.86) 
0 
We can obtain the same result directly from the quark—parton model. Since 
F, = x(F, + Fr + Fs), (7.87) 


the combination of structure functions appearing on the left-hand side of 
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(7.86) can be expressed in terms of the quark distribution function through 
(7.82) 


F P(x) — FP(x) = 2x{[u(x) — a(x)] — [d(x) — d(x)]} (7.88) 
= 4xT;(x) 


where T, is the third component of the isospin density. Using the fact that 
proton has isospin-1/2 (eqn (7.70)) we immediately recover (7.86) 


|2 [FY(x) — FP(x)] = 4 | T3(x) dx = 2. (7.89) 
0 0 


Gross—Llewellyn Smith sum rule 


The sum of the scaling functions F}, = Fp — F, 


FY(x) + F(x) = —2[u(x) + d(x) — a(x) — d(x)] (7.90) 
can be written as a combination of baryon number and strangeness densities 
F(x) + F(x) = —6[B(x) + 4S0] (7.91) 


with 
B(x) = 3[u(x) + d(x) + s(x) — a(x) — d(x) —5(x)] (7.92) 
S(x) = —[s(x) — S(x)]. (7.93) 


Since the proton has baryon number 1 and zero strangeness, we obtain the 
Gross—Llewellyn Smith (1969) sum rule 


1 


| dx[ F(x) + FX(x)] = —6. (7.94) 


0 


The momentum sum rule 


If the quarks were to carry all the momentum of the target nucleon, we would 
have the sum rule 


1 


| [u(x) + d(x) + s(x) + G(x) + d(x) + 8(x)]x dx = 1. (7.95) 


0 
Since the x ~ 0 region is not important to this integral, we can drop all the 
sea-quark contributions 


1 
| [u(x) + d(x)] xdx = 1. (7.96) 
(0) 
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The quark density functions on the left-hand side can be expressed directly in 
terms of the measurable structure functions (7.84) and we obtain 


1 


| CFP (x) + F%(x)] dx = $. (7.97) 
(0) 


Similarly, 
1 
| [FP(x) + F3(x)] dx = 2. (7.98) 
(0) 


Experimentally, however, we find the integrals in (7.97) to be approximately 
0.28. This indicates that almost 50 per cent of the nucleon momentum is 
carried by some constituents which do not interact with the electromagnetic 
or the weak currents (Llewellyn Smith 1974). This again is in accord with the 
expectations of the QCD-parton model where one identifies these neutral 
constituents with the gluons. 


Other applications of the quark-parton model 


For the remaining part of this section we shall briefly touch upon two 
other topics of the quark—parton model—its applications in the description 
of high energy e*e~ annihilations and the Drell-Yan process of lepton pair 
production in hadron-hadron collisions. We will follow the presentation of 
(Close 1979) and (Aitchson and Hey 1982). 


ete annihilation 


(1)e*e” + pp. We shall use e*e~ annihilation through the one-photon 
intermediate state into a +u” pair (Fig. 7.7(a)) as the ‘reference reaction’ in 


et ut et p 
PAA ata 
e- H e p 
(a) (b) 

FIG. 7.7. 


describing annihilations into other final states. The total cross section for 
ete — pu*p” may be calculated in QED as 


Ana? 4m?2\1/2 
3q (i — ) (2m? + q’) (7.99) 


a(e"e > PB) = 


where q is the intermediate photon momentum 


q? = (p+ +p-¥} =s20. (7.100) 
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For high energies s > mž, we obtain 


Ana? 
35 


ao(etTe > pp) = (7.101) 


1 


The fact that the cross-section falls as s~* is typical for e*e~ annihilation 


into point-like particles. 


(2)e*e” > pp. The amplitude for this process (Fig. 7.7(b)) is related to that 
of ep elastic scattering by crossing symmetry. We calculate the cross-section 
to be 


4na? (, 4M? 
o(e*e” > pp) = < (1 - ‘ 


1/2 
oe eer [2M2G Aq?) + PGU 
(7.102) 


where G;(qg*) and Gy(g?) are the electric and magnetic form factors, 
respectively (see (7.21) with q? > 0. For large gq’, Gq?) x Gy(q?) ~ q+. 
Thus for high energies s > M2, the cross-section a(e*e” —> pp) ~ s~ ° falls 
off rapidly as is typical for annihilations into any given final state of hadrons 
with structure. 


(3) e*e” — hadrons. We now consider the inclusive process of e*e~ 
annihilation into all possible hadronic final states. In the quark—parton 
model we expect this to take place via e*e” — q,q; and the quarks then 
fragment into free hadrons (Fig. 7.8(a)). The subscript i ranges over all 
possible (flavour and colour) labels of the produced quarks. Thus 


o(e*e” > hadrons) = ¥ a(e*e” > q,q)). (7.103) 
/ 
0 
et —— > e 
Vi 


(a) (b) 
Fic. 7.8. 


The ratio to the reference reaction e*e” — pp” cross-section is then 


a(e*e — hadrons) 


R= ye? (7.104) 


(ete utp) 
so the ratio R measures the sum of squared quark charges (Cabibbo, Parisi, 


and Testa 1970). Thus for energies below the charm threshold we sum over 
three colours of the u, d, and s quarks 


R=3$+4$+3)=2 for /s < 2m, (7.105) 
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and above charm and bottom thresholds 
R=24+36+3 = for /s>2m,. (7.106) 


The data seems to support this scaling behaviour with three colours. (See 
Fig. 7.9 taken from a recent review by Felst (1981).) 
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FIG. 7.9. Compilation of R-values from different e*e~ experiments. Only statistical errors are 
shown. The quark—parton model prediction (eqn (7.106)) is also indicated. 


One should also mention that e*e” annihilations have many other 
detailed features that check well with the quark—parton model description. 
For example, one finds Bjorken scaling in the differential cross-section for 
the inclusive reaction e*e” — hX (i.e., one of the final hadrons is detected) 
which is related by crossing to the inelastic ep — eX scattering. Also, there is 
strong experimental evidence that the e*e~ annihilation final-state hadrons 
form jets, i.e., they tend to flow in preferred cones of small width. For 4 GeV 
< ./s < 7.5 GeV one finds two-jet events (Hanson et al. 1975; Fig. 7.8(b)) 
with the jet axes having the angular distribution ~(1 + cos? 0), where 0 is the 
polar angle of the jet axis with respect to the e*e” beam. Such a distribution 
is characteristic of ane*e™ final state of spin-1/2 point-like particles. This is 
clearly in agreement with the expectation of the quark—parton model with its 
implicit assumption of transverse momentum cut-off. Finally at even higher 
energies (> 7.5 GeV) corresponding to the gluon bremsstrahlung as expected 
in the QCD-parton model, one begins to find evidence for three-jet events 
(Brandelik et al. 1979; Barber et al. 1979; Berger et al. 1979). 


The Drell-Yan process 


As we shall see in the next section, it is possible to provide a more formal 
basis for parton-model descriptions of the deep inelastic IN scattering and 
high-energy e*e” annihilation in terms of the light-cone and short-distance 
operator product expansions. However this cannot be done for other high- 
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energy and large-momentum transfer processes. The parton formulation has 
the advantage of suggesting parton descriptions for which direct formal 
operator argument may not be possible. The most important example is the 
Drell-Yan process (1971) (Fig. 7.10(a)) 


pp > utu “X (7.107) 


(a) 
Fic. 7.10. 


where app pair is produced in hadron—hadron (usually proton-proton) 
collisions along with the unobserved hadron state X. The parton model leads 
us to expect that in the limit of large s = (p, + p) —> œ and large virtual 
photon mass q? — œ, with the ratio q°/s fixed, the reaction can be assumed 
to proceed via the annihilation of a parton and antiparton, each coming from 
one of the initial hadrons, into a massive virtual photon which then decays 
into the observed p*p pair (Fig. 7.10(b)). In the centre-of-mass system, 
neglecting all masses, we have 


pi = (p. 0, 0, p), p} = (p, 0, 0, —p) 
and 
s = Ap? (7.108) 


Also neglecting the parton masses and transverse momenta, the parton 
momenta have the form 


ki = xpi, kh = xph (7.109) 
leading to the photon momentum 
gh = ((x; + x2)p, 0, 0, (x, — x2)p). (7.110) 
Thus we have 
q? = 4x1 Xp? = X1 X38. (7.111) 


The probabilities of a quark and antiquark pair of the ith type with 
momentum fractions x, and x, in the initial protons are given by 


Gi(X 1) dx1G;(X2) dx2 + q:(x1) dx,q;(x2) dx. (7.112) 


This is to be multiplied by the cross-section for the basic parton process of 
qdi > yw of 


do _ _. 4na? 
dq? A: > HTH”) = 3q? e? Ò(q? — xX1xX25) (7.113) 
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to obtain the cross-section for the Drell-Yan process 


1 
2 


do : Ana = 
ag? (pp > uw X) = EJ » e? | Lax 1)Gi(x2) 
0 


+ aaaea SES a -3) (7.114) 


X4X2 X1X2 


Thus the parton model will have g*(da/dq?) to scale as a function of g’/s. The 
experimental support for this prediction is quite convincing. Note that the 
same quark distribution functions are measured in the deep inelastic lepton 
proton scatterings, so one can also make a prediction of the absolute 
magnitude. 


7.3 Free-field light-cone singularities and Bjorken scaling 


In this section we study Bjorken scaling in the framework of field theory, thus 
giving some of the parton model results a more formal foundation. For a 
general introduction see, for example, Gross (1976) and Ellis (1977). 


The deep inelastic limit and the light cone 


First we will demonstrate that the deep inelastic IN processes of §7.1, with 
—qg’, v> œ, and —q*/2Mv fixed, probe the light-cone behaviour of the 
current commutator. We recall (7.12) that the hadronic tensor in the 
differential cross-section can be expressed as a current commutator 


l dix 
W(P: 9) = a3 ie <p, ol Jy(x), JO)IIp,o>. (7.115) 


The scalar product in the exponential may be written 


co Wows Gs) (07 A) 5, oe Wa) ro 8) aa ys (7.116) 


v J2 R J2 


where qr =(41,92) and x;=(x,,x,). In the rest frame of the 
target nucleon, the momenta are given by 


Py =(M,0,0,0,), q, = (v, 0, 0, (v? — q°)Ż). (7.117) 
In the deep inelastic limit (— q*, v > œ with —q?/2Mv fixed) we observe that 
dotq3~2v and qo— q3 ~ q?/2v. (7.118) 


We expect that the dominant contribution to the integral (7.115) comes from 
regions with less rapid oscillations, i.e., g°x = O(1); hence 


Xo —X3~ O(1/v) and x + x3 ~ O(1/xM) (7.119) 
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or 


i 
x2 — x2 (=) (7.120) 


Thus x? = xê — x3 — x2 < x2 — x} ~ O(1/—q?°) which vanishes as 
—q* — œ. In other words, in the scaling limit we are probing the structure of 
the current product near the light cone. This reduces the study of Bjorken 
scaling in field theory to the study of the light-cone behaviour of the current 
product. 


Free-field light-cone singularities 


As it turns out, Bjorken scaling corresponds to the statement that the current 
commutator has the light-cone behaviour of a free-field theory (for a review 
see Frishman 1974). To pave the way, we shall first study the free-field light- 
cone behaviour of some simpler products. 


(1) Products of fields. In free field theories, the products of fields such as 
commutators and propagators are singular on the light cone (x? ~ 0) and the 
leading singularities are independent of the masses. Consider, for example, 
the propagator of the scalar field given by 

COITO) = iaro =i | SKS 
OES" J Qn) k? — m? + ie 
The Fourier transform in (7.121) can be calculated to give (see, for example, 
Bogoliubov and Shirkov 1959) 


(7.121) 


r(x) = F(x?) + gp PODO V2) — IN (n V2?) 


gr oat AK-A) 012) 


where J,, N,,, and K, are Bessel functions. For x? ~ 0, we have 


—] i 1 im, myJyxê m? 
A = g eO + Fag — ga ln — 


_ 2 
Bn 2 T 


l 1 


ese 252). 
A Ge) a) + O(m*x*) (7.123) 


The leading singularity is independent of the masses as the x? ~ 0 region 
corresponds to the large k? in the momentum space. Thus we can also 
calculate this mass-independent singularity directly from a simpler object, 
the massless propagator 


dtk eTit 
Arx) = lone k? + ie 
dèk ae dko eT Koo 
(27)t ki —k? + ie 


-x 


(7.124) 
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The ko-integration can be performed by the standard contour method 


dko eT Koro = 17 ; : 
= o —ilk|xo 0(— ilk|xo i 7.125 
We then have 
l , , 
AG) = -o | d(e" — e-*) (x5) e" + O( — Xp) e] (7.126) 


0 


where x = |k| and r = |x|. Using the identity 


e dt = | ett" dr = - , (7.127) 
(a + 18) 
0 


we obtain 


—] l l 
E EE T E ee 
P(x) zl (xo) (. — Xo + ie a + xo — z) 


l i 
o(— A LE E S oD eee 
oe «(+ ) 


Al Axo) 4 8(— xo) l 


~ An? | r? — x2 + iexy r? — x2 — iexo 
l l 
eee S 7.128 
An? x? — ie ( ) 


which agrees with (7.123). One can do a similar calculation of the leading 
singularity for the commutator of two scalar fields 


[ d(x), ¢(0)] = iA(x) = ais | dtk e~™ *e(ky) 6(k? — m?) (7.129) 
with the result 
A(x) = = elxo) ô(x) for x? zO. (7.130) 


Thus, we have the singular-function identity 
i | dtk e™™ “efko) lk?) = (27) elxo) 6(x?). (7.131) 


The result (7.130) can be viewed in another way: the light-cone singularity of 
the commutator A(x) and that of the propagator function A,(x) are directly 
related 


A(x) = 2e(xo) Im(iA,(x)). (7.132) 
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This reflects the singular-function identity 
l l 
—— — — = —2rielxo) (x? 7.133 
—x?+ig —x? —ie oye) or 


which is a special case of the general aed 


1 \ 1 y 
(<a) -(—=;)-< (n= eye ek Aa 


In the following calculations we shall obtain the commutator singularities 
through those of the propagators by making the replacement 


(—x? — ie)" > 2rie(xo) 57 (x?) /(n — 1)! (7.135) 
For the fermions, the results are summarized as 
(Val), PeO) = iSu — Y), 
Sap(x) = (iy -ô + m), A(X), 
COITADO = iSl — y), 
SE) = (iy -ô + m)ag Ag(x). (7.136) 


For x? x 0, we have 


l 
Sap X) © Gy’ Dag È (Xo) 50) | (7.137) 


Sagl) ~ (iy: Daal z (7.138) 


l 
4r? (x? — ie) 
(2) Product of scalar currents. We can extend this analysis to the case of 
composite operators. Consider for example the scalar current 


I(x) = 16(x):. (7.139) 


Note that the effect of the normal ordering is to remove the singularities 
which occur in the product ġ(x + d(x — ¢) as ¢* > 0. The singularities in 
the product T(J(x)J(0)) can be worked out by using Wick’s theorem 


T(J(x)J(0)) = T(:67(x)::67(0):) 
= 2[<0|T(o(x)d(0))|0>]? 
+ 4¢0|T( G(x) G(0))]0>: (x) P(0): 
+ :6°(x)*(0): 
= —2[A;-(x, m)]? + 4iA-(x, m): 6(x)G(0): 
+ :?(x)62(0):. (7.140) 
Hence for x? = 0, we get 


TOUGH Sees en OO: 


Brt(x? — is)? -27(x* — ie) 


+ :6?(x)6?(0):. (7.141) 
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If we take (7.141) between two arbitrary states |A> and |B), 


A|B A|: 0): IB 
aaoo a ea Ae 


82*(x? — ie)? 1*(x* — ig) 
+ <A|:67(x)o7(0):|B> (7.142) 


which corresponds to the diagrams in Fig. 7.11. 


J (x) J (0) 
x 0 
Vales GD avara x 0 x 0 
— + + 

A B 

A B 
A B A B 
(b) (c) 


(a) 


Fic. 7.11. Diagrams that are singular on the light cone. Free massless propagators are 
represented by light straight lines: (a) has two; (b) has one; (c) has none. Note that only (a) will 
contribute if |4 = |B> = |0>. 


To calculate the singularities of the commutator [ /(x), J(0)] we only need 
use the identity given in (7.135). 


Free-field singularities and scaling 


Now we are ready to demonstrate Bjorken scaling in free-field theory. 
Consider the electromagnetic current given by 


F(x) = PONY): (7.143) 


where Q is the charge operator. Following the same procedure used in the 
above case of scalar current densities, instead of the commutator 


LJ,(x), J,(0)], we will first calculate the time-ordered product by using 
Wick’s theorem 


T(J, (x) Jy(0)) = TCP, QW): (0)y,QW(0):) 
= tr[iS-(—~x)y,iS-x)y,Q7] 
+ :W(x)7,QiS-(x)y,QW(0): 
+ :W(0)y,QiS—(—x)y,QW(x): 
+ :W(x)y,QW(x)W(0)y,QW(0): (7.144) 
where Sp(x) was defined in (7.136). Using (7.138) and the identity 


Vulva = (Suvap + lEnyap)s)Y" (7.145) 
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where Savio = GuvGap + GupGva — GusJvp» We can write (7.144) in the limit 
x? ~ 0 as 


T(J,(x)J,(0)) © (tr 9°) O Juv = 2X4) 


n*(x? — is)* 
į PE ET £5. aval V8(x, 0) — V4(0, x)] 
+ i&yayglA"(x, 0) — A°(0, x)]} 
+ :W(x)y,Qw (x) (0)y,Qy (0): (7.146) 
where 
V(x, y) = P Wy): (7.147) 
A*(x, y) = POY YO): (7.148) 
If we write 
E 2 Gus l | 
(x? iat 3i 12 On % (x? — ie)? an 
and 
x -1 l 
ope (z = z) (7.150) 


and use the substitution (7.135), we obtain the leading light-cone singularities 
of the current commutator 


it 


LIO, OJ & EE Baggy APE) + § 2 DOE) 


a {SpavpL V(x, 0) E V*(0, x)] 


T i€ avg [4 (x, 0) = A*(0, x)]} 0" a o 


+ :Wx)y,QWoyW(0)y,Qy(0):. (7.151) 


We can then translate this explicit form of the current commutator into 
statements on the cross-sections for e*e~ annihilation and for inelastic IN 
scatterings. 


(1) e*e” annihilation. Following the procedure of §7.1 it is straight- 
forward to show that the total hadronic cross-section for e*e~ annihilation 
can be written as a current commutator 


n a? 


o(e*e — hadrons) = ead 
q 


[ats e" *<O|LJ,(x), J*#(0)]|0>. 
(7.152) 


The most singular light-cone term comes from the first one on the right-hand 


224 The parton model and scaling 7.3 


side of (7.151). (Thus this actually probes the short distance behaviour of the 

current commutator.) 

82707i tr Q? 
3n°(q")? 

+ 4 07[6'(x*)ée(xo)]}. (7.153) 


| d+x el *{8 5"(x2)e(x) 


o(e*e — hadrons) % 


Using the identity in (7.131), we get 


E 82707 tr Q? (8 g 
o(e*e7 — hadrons) ~ ~ 3n(q2)2 (5 7 ae 8(q)e(4o) 
2 
= A  Q? (7.154) 
3q 
or 

Scot 

_ o(e-e — hadrons) wO (7.155) 


© olete > putu”) 


This justifies the results of the parton model (7.104) if the leading short 
distance singularity is that of the free-field theory. We next consider the 
genuine light-cone process of deep inelastic lepton—hadron scattering. 

(2) Lepton—hadron scattering. For deep inelastic IN scattering (7.115), the 
first term on the right-hand side of (7.151) will not contribute since it is a c- 
number; thus the nontrivial leading singular term will be the second one 


POA A] 
venre 
re] 


(7.156) 


We can expand the bilocal operator 


{x x £ = x le © x" x”? 
kea 


" Y(O) By, Ona- -Ôn PO) 
(7.157) 
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x x | x”! x”? xn 
a aoe = Dek Seas ~ fo(n+1) 
G) sf 3] m 2 n! 2 2°02 Obuina... u0) 
a| O(X7)E(Xo) 
X Syavp Ô ee 


Lt Re AEN a 
PD -y y y O Burna- -ua0) 


even n 2 


to get 


ee] (7.158) 


X lEpavp l J 
where 
r o0 ee <> 
OS u...un (0) = Y0) ôn, Oaks ô yQ? y (0) (7.159a) 
inti ra T TORA) 

O'n... u (0) = WO) ĉn, Ou, - - -nY Y5Q Y (0). (7.159b) 
To calculate the structure functions, we write 
l 
5 Y <pal Ov D.,,(O)po> = A"* Yp*p,, Pu- Pu, + trace terms (7.160) 


where A“"* +) some constant and where the trace terms, which contain one 
or more factors of g,,,,, will produce powers of x* when contracted with 
x#1x#2... x" in (7.158) and are less important near the light cone x? ~ 0. Also 
the 0’"*») term will not contribute to the spin-averaged structure functions 
due to the antisymmetry property of ¢,,,,. We then have for (7.115) 


Lb. PO 2 ep pe 
W ~~ —— weg Aaty 
mhp» 9) rake j z 2 ) n! 


2 
x Snavp | Pee tro) | (7.161) 
27 
Define 
oo , n Aint) a 
odd n . 
then 


` rqi 
W, (p.q) © TT | = et [ae ee) 


X Supla + E ERD, (7.163) 
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Using the identity (7.131) 


da 
i | a e™ GP (x?)el(xo) = Êl + Ep eldo + Epo), (7.164) 


we have 
l 
Way © VW [acs (€) 6(q* + 2Mvé) 
x (GuaJ vp + IG up va 7 GuvIap (4 T čp)"p’ 


l 
a | df (E) 6(E + G?/2Mv)(—Mvg,y + 2Ep,p, +...) 


2M?y 
Juv , X PuPv 
= — — ve 7.165 
Foo oop or + l (7.165) 
for x = —q°/2Mv. Thus we recover the parton-model results of eqns (7.56) 


and (7.57) 
MW, > F\(x) = 2f(x) 
vW > F(x) = xf (x). (7.166) 


This implies that the assumption of canonical free-field light-cone structure is 
equivalent to that of the parton model. 


PART Il 


8 Gauge symmetries 


THE symmetries we have discussed up to this point are global symmetries. The 
parameters ¿° of the symmetry transformation in eqn (5.10) are independent 
of space-time; thus fields at different space-time points are all supposed to 
transform by the same amount. We now consider theories where the 
symmetry transformations are space-time dependent, 1.e., €f = e*(x). They 
are called local symmetries or gauge symmetries (Weyl 1929). We shall see 
that such symmetries may be used to generate dynamics, the gauge 
interactions. The prototype gauge theory is quantum electrodynamics. It is 
now believed that all fundamental interactions are described by some form of 
gauge theory. In the first section, after an introductory discussion of QED 
with its Abelian U(1) local symmetry, we study the fundamentally richer 
systems of non-Abelian gauge theories, the Yang—Mills theories (1954). After 
an elementary geometric look at gauge invariance, we present in the last 
section the subject of spontaneous symmetry breakdown in a gauge theory. 


8.1 Local symmetries in field theory 
Abelian gauge theory 


As we have already stated, QED is an Abelian gauge theory. It is instructive 
to show that the theory can actually be ‘derived’ by requiring the Dirac free 
electron theory to be gauge invariant and renormalizable. 

Consider the Lagrangian for a free-electron field W(x) 


Ly = Piy" ð, — m)p(x). (8.1) 


Clearly it has a global U(1) symmetry corresponding to the invariance of the 
theory under a phase change 


(x) > Yx) = eyx) 
W(x) > YŒ) = W(x). (8.2) 


We are going to turn this symmetry into a local symmetry, i.e., ‘to gauge the 
symmetry’ by replacing « with a(x). Thus we are going to construct a theory 
which will be invariant under a space-time dependent phase change, 


W(x) > W(x) = E(x). 
W(x) > W(x) = PO). (8.3) 
The derivative term will now have a rather complicated transformation 
W(x) OW(x) > Wr) Oh (x) = W(x) a 4,(e7 OAY) 
= W(x) ôy) — P(x) A,a(x)W(x). (8.4) 
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The second term spoils the invariance. We need to form a gauge-covariant 
derivative D,,, to replace 0,,, and D,,(x) will have the simple transformation 


D(x) > [DW] = cD, W(x) (8.5) 


so that the combination W(x)D,,W(x) is gauge invariant. In other words, the 
action of the covariant derivative on the field will not change the 
transformation property of the field. This can be realized if we enlarge the 
theory with a new vector field A,(x), the gauge field, and form the covariant 
derivative as 


Dw = (ô, + ieA,W (8.6) 


where e is a free parameter which we eventually will identify with the electric 
charge. Then the transformation law for the covariant derivative (8.5) will 
be satisfied if the gauge field A,(x) has the transformation property 


A(x) > Ai(x) = A,(x) + = 0,02) (8.7) 


From (8.1) we now have 


Lo = piy (ô, + ieA DY — mp. (8.8) 


To make the gauge field a true dynamical variable we need to add a term to 
the Lagrangian involving its derivatives. The simplest gauge-invariant term 
of dimension-four or less (with a conventional normalization) is 


t= —4F pF” (8.9) 
where 
Fy = 0,A, — 0,A,. (8.10) 


uv 


By direct substitution of (8.7) we see that F„, is in fact gauge invariant by 
itself. It is useful to see this in another way—the antisymmetric tensor F, is 
related to the covariant derivative as 


(D,D, — D,D,) = ieF „Y. (8.11) 


From (8.5) is it easy to see that 


[(D,D, — D,D,)w =e""[(D,D, — D,D,)y] (8.12) 

Or 
Fil! = (FyyWye™ (8.13) 

or 
| ee cee (8.14) 


Combining (8.8) and (8.9) we arrive at the QED Lagrangian 


L = Piy"(G, + ied Y — mp — 4LF F”. (8.15) 
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The following features of (8.15) should be noted 


(1) The photon is massless because a 4,4” term is not gauge invariant. 

(2) The minimal coupling of photon to the electron is contained in the 
covariant derivative Dw which can be constructed from the transformation 
property of the electron field. In other words, the coupling of the photon to 
any matter field is determined by its transformation property under the 
symmetry group. This is usually referred to as universality. Other (higher- 
dimensional) gauge-invariant couplings such as wo,,WF*” are ruled out by 
the requirement of renormalizability. 

(3) The Lagrangian of (8.15) does not have a gauge-field self-coupling, 
because the photon does not carry a charge (or U(1) quantum number). 
Thus, without a matter field, the theory is a free-field theory. 

We shall see that the first two features will still hold for non-Abelian 
gauge theories but the last will not. The presence of gauge-field self- 
coupling will make such non-Abelian theories highly nonlinear and will give 
rise to a number of fundamentally distinctive properties. 


Non-Abelian gauge symmetry— Yang-Mills fields 


In 1954 Yang and Mills extended the gauge principle to non-Abelian 
symmetry. (For subsequent development of the Yang—Mills theories see 
Utiyama 1956; Gell-Mann and Glashow 1961.) We shall illustrate the 
construction for the simplest case of isospin SU(2). 

Let the fermion field be an isospin doublet, 


Yı 
= . 8.16 
: iq sale 


Under an SU(2) transformation, we have 


—it:8 
2 


W(x) > W(x) = exp lyo (8.17) 


where t = (T4, T2, T3) are the usual Pauli matrices, satisfying 


Ti Tj i,“ he 
E a sity  ijk=1,2,3 (8.18) 
and 0 = (01,02,03) are the SU(2) transformation parameters. The free 
Lagrangian 

Lo = Yxi" 6, — m)p(x) (8.19) 
is invariant under the global SU(2) symmetry with {0;} being space-time 
independent. However under the local symmetry transformation 


W(x) > w(x) = UO) (8.20) 
with 


(8.21) 


U(0) = exp pay. 


2 
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the free Lagrangian Y, is no longer invariant because the derivative term 
transforms as 


W(x) OW) > W(x) d(x) = Wx) Ay) 
+ YAU *(8)[6,U(@)]W(x). (8-22) 


To construct a gauge-invariant Lagrangian we follow a procedure similar to 
that of the Abelian case. First we introduce the vector gauge fields A}, 
i=1,2,3 (one for each group generator) to form the gauge-covariant 
derivative through the minimal coupling 


2 
DW = (a, — ig 


where g is the coupling constant in analogy to e in (8.6). We demand that 
D,w have the same transformation property as y itself, 1.e. 


DW > (D,Y) = U(@)D,Y. (8.24) 


ty (8.23) 


This implies that 


(a, - ig“ *\ ue = voa, ~ig- oy (8.25) 
or 
2.00 -ig A neey =- 
or 
CA; 
= = V0) An y- 0) — <1, ONO (8.26) 


which defines the transformation law for the gauge fields. For an in- 
finitesimal change (x) « 1, 
0 
U(d) 1 -i~ ne (8.27) 


and (8.26) becomes 


T'A, T'A, te T 
= — ii Ak i -ô 
2 2 E at A ) 


Bo ins 1 /t 
= = pe + emcolat I) 


Or 
re ee ee 
Ay = Ay + OAL — 0,8 (8.28) 


The second term is clearly the transformation for a triplet (the adjoint) 
representation under SU(2). Thus the A/s carry charges, in contrast to the 
Abelian gauge field. 
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To obtain the antisymmetric second-rank tensor of the gauge fields we can 
follow (8.11) and study the combination 


e 
with 
tF tA, T'A, .|t°A, TA, 
T Ô, 5 0 o| AE l (8.30) 
or 
Fi, = ô, Ai — ôA; + ge ALA‘. (8.31) 


From the fact that D,w has the same gauge transformation property as w, we 
see that 


[(D,D, ig D,D DYT Tr U(0(D,D, m D,D,)w. (8.32) 
Substituting the definition (8.29) of Fi, on both sides of (8.32), we have 
t'F) UO) = U(O)t FW 


or 
t: F, = U(@)(t-F,,)U~*(6). (8.33) 
For the infinitesimal transformation 6; « 1, this translates into 
Fi, = Fi, + OFE. (8.34) 


Unlike the Abelian case, Fi, transform nontrivially, like a triplet under 
SU(2). However the product 


tr{(t: F(t FA) oc FF 


is gauge invariant. 

We can summarize the above discussion by displaying the complete gauge- 
invariant Lagrangian which describes the interaction between gauge fields A’, 
and the SU(2) doublet fields 


= LFF! + jiy D,y — mb (8.35) 
where 
Fi, = ôA, — 0,Ai, + ge" Ai AY (8.36) 
“A 
D,W = (a, EA ; “y. (8.37) 


The SU(2) gauge transformations of fields are 
; T° Ox) 
W(x) > W(x) = exp — get W(x) = UW) (8.38) 
T'A, T'A, 
2 2 


~ U(6) (Ju = : [a,U(6)]U-1(6) (8.39) 
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with infinitesimal forms 
t0 
KA SEY (8.40) 


Ai => AË = Ai + e0 AE — 5 oo (8.41) 


Generalization to higher groups and arbitrary representation for w is 
straightforward. Let G be some simple Lie group with generators satisfying 
the algebra 


[F°, F°] = iC Fs (8.42) 
where the C’”s are the totally antisymmetric structure constants. W is 


supposed to belong to some representation with representation matrices T”. 
Thus 


ae ee es | Olaaey das (8.43) 
The covariant derivative is then 

Div = (0, — igT*Apy (8.44) 

and the second-rank tensor for gauge fields is 
Fi, = 0,A$ — 0,Aa + gC? ALAS (8.45) 
(T -F),, = 0,(T-A,) — 0,(T-A,) — ig[T-A,, T- Ay] (8.46) 
L = —4F1,F + Wiy"D, — my. (8.47) 
The Lagrangian is then invariant under the transformation of the group G 
W(x) > W(x) = UT Olx) = UO) (8.48) 


T-A, (x) > T-A (x) = U(0,)T-A,U~*(,) 
- + [8U (0U -0y (8.49) 
with the infinitesimal variations taking on the forms 
Wx) > VE) = WOR) = ITPA) (8.50) 
A(x) > A(x) = A%(x) + C%*O(x)AE(x) — - 3 0x). (8.51) 


The pure Yang-Mills term, —7F4,F", contains factors that are trilinear 
and quadrilinear in Aj, 
2 
—gCc* 0 ,ASA”™ A” a = CeCe Ab AS AM AS, (8.52) 
which correspond to self-couplings of non-Abelian gauge fields. They are 


brought about by the nonlinear terms in F4, (8.45), because the gauge fields 
A, themselves transform nontrivially, like the generators, as members of the 
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adjoint representation. While this is fundamentally different from (rather, 
richer than) in the Abelian case, the properties of universality and 
masslessness of the gauge fields remain essentially the same. We note that the 
number of massless gauge fields is equal to the number of generators of gauge 
symmetry. Concerning universality we should make two comments about the 
coupling strength. 


(1) In the case of Abelian gauge theories there is no restriction on the 
coupling strength between the gauge field A, and other fields. Thus the 
electron carries charge e and another particle can in principle carry any 
charge Ae with an arbitrary À (for instance 2 = z). In a non-Abelian gauge 
theory such as the SU(2) case considered above with a doublet w, the 
situation is more restrictive. If one tries, for example, to couple the gauge 
field to an extra doublet @ with a coupling Ag, the commutation relation 
(8.18) insuring gauge invariance gets rescaled and implies 47 = A or A = 1. 
Basically in non-Abelian theories the normalization of the generators are 
fixed by the non-linear relation of commutator, hence g cannot be scaled 
arbitrarily. 

(2) Can there be different gauge couplings associated with different gauge 
fields? If the group is simple, as just stated, there can be only one coupling 
constant. However if the group is a product of simple groups such as SU(2) 
x SU(3) where each set of generators closes under commutation and 
commutes with other sets, there will be an independent coupling for each factor 
group. 


8.2 Gauge invariance and geometry 


Einstein’s successful formulation of general relativity in 1916 unveiled a 
profound connection between gravitation and geometry. This discovery 
inspired Weyl! (1919, 1921) to incorporate electromagnetism into geometry 
through the concept of a space-time dependent (local) scale transformation. 
Namely, at a neighbouring point, a distance dx" away, the scale is changed 
(from one) to (1 + S, dx"), and thus a space-time dependent function is 
changed according to 


f(x) > (f+ (6, f) dx") + S, dx") = f+ [(0, + S,) fy dx". (8.53) 


Weyl tried to derive electromagnetism by requiring invariance under this 
local scale transformation and by identifying the scale factor with the vector 
potential: S, A,. His initial attempt was not successful. By 1925 modern 
quantum mechanics has emerged. Here a key concept was to identify the 
momentum with the operator (—10,), and the canonical momentum in the 
presence of electromagnetic field with (—10, + eA,). It was then realized that 
the correct identification of Weyl’s scale factor should be S,<+1iA,,, and that 
what would be required would be invariance of the theory under space-time 
dependent phase transformation, see eqn (8.3). However when Weyl (1929) 
finally worked out this approach he retained his original terminology of 
‘gauge invariance’, the invariance under a change of the scale, a change of the 
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gauge. For a concise history of the gauge field concept, the reader is referred 
to the lectures by Yang (1975). 

The framework for a proper geometrical discussion of gauge fields is the 
modern theory of fibre bundles. However such a differential geometry study 
would be beyond the scope of this presentation. It suffices for us to note the 
existence of a deep geometric foundation to the gauge field concept. Here we 
merely present an elementary geometric look at gauge invariance. We will 
show how gauge fields A, describe parallel transport in charge space with the 
curvature tensor being the field intensity F,,. 

We briefly review some of the basic geometric concepts for a curved space. 
First, there is the notion of parallel transport. In any space, to compare two 
vectors (or any tensors) at two different space points V,(x) and V,(x’), we 
must first move (parallel transport) V, from x to x’, 1.e. the two vectors must 
be in the same coordinate system before we can take their difference. Thus 
there are always two steps in such a comparison 


DV, = ôV, + dV, (8.54) 


where ôV, is the (apparent) change due to moving these two vectors to the 
same coordinate origin and dV, is their difference measured in the same 
coordinate system. The operational definition of parallel transport 1s such as 
to keep the vector, throughout the transport, at a fixed angle to the tangent of 
the trajectory. Clearly parallel transport is a trivial operation in flat 
(Euclidean) space as it does not introduce any change of the vector ôV, = 0, 
and covariant differentiation is simply ordinary differentiation 


DV# = dV" = (0,V") dx’ in flat space. (8.55) 


However, in a curvilinear system there will be an apparent change in such a 
translation, as the coordinate axes differ from point to point in such a system 
(i.e. the metric is position-dependent). For x andx’ infinitesimally separated 
by a distance dx*, we expect dV, to be linear in dx* and V" 


ôV! = —T4,V" dx? 
and 
OV, = Day, dx? (8.56) 


since 6(V"V,) =0. The coefficient I}, is the Affine connection or the 
Christoffel symbol and may be shown to be (some combination of) the 
derivative of the metric (and hence vanishes in a space with constant metric). 
The comparison of the original vector at x’ after parallel transport from x to 
x’ results in 


DV! = V(x) — [V + 6V*] 
= (ô, V” + T6 V”) dx? (8.57) 


where the combination in the parenthesis is the covariant derivative. This 
contrast between flat and curved spaces is illustrated in Fig. 8.1. 
Another important concept in non-Euclidean geometry is the curvature 


8.2 Gauge invariance and geometry 237 


tensor which can be best introduced through the notion of parallel transport 
of a vector around a closed path (see Fig. 8.2 for an illustrative comparison). 
Consider the apparent variation of the vector after being moved around a 
small parallelogram PPP, P} composed of two vectors a’ and b” (and their 
parallel displacements) (Fig. 8.3). Let ôV, and ôV, be the apparent changes 
along the paths PP,P, and PP,P,, respectively. Thus the total apparent 
change for a round trip is 


AV, = ôV, — ôv, (8.58) 

with 
ôV, = (I? V,a + (T° V,)p (b! + ôb) (8.59a) 
ôV, = (Tug Vy)b! + (Tia V,)p, (a + ôa”) (8.59b) 


We can expand the quantities evaluated at points P, and P, so that all 
tensors are measured at a common point P 


(Tig V, )r, =r (Tig + OnT upa) V, T Lsk Va"); (8.60) 
similarly for (I,.V,)p,. Substituting these results and 


a + da* = a — Tha b" (8.61) 


P P yN ; 
(b) 


\ 
\\6 
\ 
yey y“ 
(a) 


Fic. 8.1. Apparent changes induced by parallel transport (a) in flat space (no apparent change); 
and (b) in curved space. 


(a) (b) 


Fic. 8.2. Apparent changes induced by parallel transport (a) around a closed path 1-2-3-4 in 
flat space; and (b) on a spherical surface. 


P,  bf+ôbf P, 
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and a similar expression for (b? + ôbf) into eqns (8.59) and (8.58), we have 


AV, = Rag Vo". (8.62) 
The apparent change around a closed path is proportional to the vector itself, 
to the area (tensor) bounded by the path o% = a%b’, and to the curvature 
tensor 

Rig = Old yg — Opt a + Vig Ge — Vi g. (8.63) 
For the rather simple example of a spherical triangle with 90° at each vertex 
(Fig. 8.2), the apparent change is clearly 2/2. This agrees with eqn (8.62) as 
the curvature tensor reduces to a curvature R = 1/r? where r is the radius of 
the sphere. When multiplied by the area of the triangle zr7/2 one gets an 
apparent change of 7/2. 

A direct comparison of (8.57) with the gauge covariant derivative D,w 
(8.44) indicates that T: A, has a geometrical interpretation as the ‘connec- 
tion’ (i.e. Christoffel symbol) in the internal charge space. As in (8.56), under 
a parallel transport the field W(x) undergoes, because of the local change of 
axes, an apparent change 


W(x) > W(x + dx) = w(x) + d(x) 
with 
OW(x) = igT - A,w dx" (8.64) 
where T is the set of representation matrices of the symmetry generators and 
W is the basis vector. 


For parallel transport of a finite interval from x to x’ we can exponentiate 
(8.64) and obtain 


P(x’, x) = exp fi | T-A,(y) a} (8.65) 


x 


where the line integral is taken along a path joining x and x’. Thus for every 
path we can associate a group element. Let us check that such an 
interpretation is compatible with the transformation properties of gauge 
fields. For simplicity consider infinitesimal parallel transport (8.64) 


Wx + dx) = (1 + igT - A, dx")W(x). (8.66) 


Now for a different choice of frame at each point, we make a gauge 
transformation, a rotation of axes, at each point by T :@0(x) as in (8.48) 


Yx) > W(x) = U0) x) (8.67) 


and 
W(x’) > Yx’) = U (8, Yx’). (8.68) 
In order to keep the product w(x + dx)P(x + dx, x)W(x) invariant, parallel 


8.2 Gauge invariance and geometry 239 


transport ‘connection’ must transform according to 
1 + igT - Aj, dx" = U(0,44,)(1 + igT - A, dx")U~*(6,) 
= (U(0,) + 6,U(6,) dx”)(1 + igT - A, dx")U~*(6,) (8.69) 


or 
T: A(x) = U(6,)T-A,O~ *(6,) — : [0 U(0.)]U *(6,) (8.70) 


which is the required transformation property of gauge fields as given in 
(8.49). This result clearly holds also for finite separations. Thus the parallel 
transport operator (8.65) has the gauge transformation 


P(x’, x) > P(x’, x) = U(0,.) P(x’, x)U (0). (8.71) 


A direct comparison of T-F,, (8.46) with Rf, (8.63) suggests that T-F,,, 
be interpreted as the curvature of internal charge space. This can be checked 
explicitly by considering parallel transport around a closed path C. For 
simplicity we choose C to be a parallelogram with one corner at x, and two 
sides dx, and ox,. 


Po = P(x, x + dx) P(x + dx, x + dx + 6x) 
x P(x + dx + ôx, x + ôx) P(x + Ox, x) (8.72) 


where the Ps are the parallel transport matrices of (8.65). Using the matrix 
identity 


e44 eB — e4l4+B)+1*/2[A, B] $ o(A3), (8.73) 
we find 
P(x, x + dx) P(x + dx, x + dx + 6x) 
= expLigA,(x) dx"] exp[igA,(x + dx) 6x”] 


= exp{ig(A, dx" + A, dx” + 6,A, dx" ôx”) — A [A,, A,] dx" ôx} 
(8.74) 
and 
P(x + dx + dx, x + 6x) P(x + ôx, x) 
= exp[ —igA,(x + 0x) dx] exp[ —igA,(x) dx”] 
= exp{ —ig(A, dx” + A, ôx” + 0,A, dx" ôx”) — 4 [A,, A,] dx“ dx’}. 


(8.75) 
Thus (8.72) may be written as 


Po = exp{ig(0,A, — 0,A, — iglA,, A,]) dx" 6x’}. (8.76) 
In eqns (8.74)-(8.76) we have simplified the notation by writing T- A, as 4,. 
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The above calculation justifies the identification of 
T-F,, = 0,T-A, — ô, T-A, — ig[T-A,, T: Ay] (8.77) 


as the curvature tensor of the internal charge space. 

In summary, the essential point is that, in any (physical or internal) space 
where the coordinates are position-dependent, the significance of comparing 
two vectors (or any two tensors) at different points is lost. The standard way 
of dealing with this problem is to introduce the notion of parallel transport 
or affine connection. In the case of physical curved space-time the Christoffel 
symbol is introduced and in the case of internal charge space the gauge fields 
are introduced. They ‘compensate’ for the change of local frames at each 
space time point. 


8.3 Spontaneous breaking of gauge symmetry, the Higgs 
phenomenon 


We saw in §8.1 that the imposition of local symmetry implies the existence of 
massless vector particles. If we want to avoid this feature of the gauge theory 
and obtain massive vector bosons, the gauge symmetry must be broken 
somehow. If we introduce explicit breaking terms in the form of arbitrary 
gauge boson masses we alter the high-energy behaviour of the theory in such 
a way that the renormalizability of the theory is lost (see the discussion in 
§2.4). We may contemplate the possibility of spontaneous breaking of the 
symmetry, as discussed in §5.3. Thus, we have the situation of a hidden 
symmetry: the Lagrangian is still fully invariant under the symmetry 
transformations but the dynamics are such that the vacuum, the ground 
state, is not a singlet of the symmetry group. The choice of one from all the 
possible degenerate ground states as the physical vacuum breaks the 
symmetry. This spoils the usual symmetry consequence of energy-level 
degeneracies. But, according to the Goldstone theorem of §5.3, this would 
imply the existence of a set of massless scalar bosons. Thus either way it 
would seem that we run into undesirable massless particles. 

As it turns out the Goldstone theorem is evaded in gauge theories as the 
proof of the theorem requires the validity of all the usual field theory axioms: 
manifest Lorentz covariance, positivity of the norm, etc. There is no gauge- 
fixing condition we can impose for which a gauge theory obeys all the axioms 
of the usual field theories. In covariant gauges we have states of negative 
norm (longitudinal photons); in the radiation or axial gauges, we do not have 
manifest Lorentz covariance. If we regard the massless gauge bosons and 
massless Goldstone bosons as diseases of the theory, each turns out to be the 
cure of the other. They both disappear from the physical spectrum of the 
theory by combining to form massive vector particles, without ruining the 
good high-energy behaviour of the symmetric theory. This remarkable 
phenomenon was first suggested by Anderson (1958, 1963) who pointed out 
that several cases in nonrelativistic condensed-matter physics may be 
interpreted as due to massive photons. Particularly in superconductivity we 
have the phenomenon of magnetic flux exclusion (the Meissner effect) and 
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this corresponds to a finite-range electromagnetic field, hence a ‘massive 
photon’. The extra longitudinal component is in fact coupled to the collective 
density fluctuation of the electron system—the plasma oscillation. The 
proper generalizations to relativistic field theory were carried out by Englert 
and Brout (1964), and by Guralnik et al. (1964), and more completely, by 
Higgs (1964a, 1966). In the literature it is commonly referred to as the Higgs 
phenomenon. °t Hooft (1971b) first showed that gauge theories are re- 
normalizable even in the presence of spontaneous symmetry breakdown. 


Abelian case 
Consider the simple case of Abelian U(1) gauge theory. 
L = (D.o) D") + p'o — Apor — Fy F” (8.78) 
where 
Did = (6, — 1gA,) 
F,, = 0,A, — 0,A,,. (8.79) 
The Lagrangian is invariant under the local gauge transformation 


P(x) > G(x) = epa) 


l 
A ,(x) > 4x) = A,(x) — 7 ô,a(x). (8.80) 
When x? > 0, the minimum of the potential 
Vlo) = -K p'o + Ab")? (8.81) 
is at 
|p| = v/ 2 (8.82) 
with 
v = (p?/d)?. (8.83) 
This means that the field operator @ develops a vacuum expectation value 
[COIP] = v/ V2. (8.84) 
If we write @ in terms of the real fields ¢, and ¢, 
] 
p= V2 ($, + ig2), (8.85) 
we can choose 
<0|o;|0> =v and <O|p,|0> = 0. (8.86) 


Thus, the Lagrangian (and the potential) have U(1) symmetry and the 
minimization can only fix the modulus of @. To pick one as in (8.86) out of 
this infinite number of possible minimum values as the physical vacuum 
breaks the symmetry. (This example is essentially the one given in §5.3 witha 
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change of notation: o > ¢, and z > @,.) Thus with the shifted fields 
d,=9,—-v and $,= 9, (8.87) 


we would conclude that #5 corresponds to the massless Goldstone boson. 
The added feature of the case at hand is of course that we have a local gauge 
symmetry. The ordinary derivative is replaced by the covariant derivative. 
That term will yield 


|D,,0I? = (ô, 7 igA,) ol? 
= (6,61 + JAP) + Op — GA uh 1) 


212 
— gvA"(3,6', + JAQ) + a AMA, (8.88) 


The last term can be interpreted as a mass term for 4,. Thus the gauge boson 
acquires a mass M = gv. 


Unitary gauge (Abelian case). The presence of the term 
gvA* 0,05 (8.89) 


in eqn (8.88) will bring about a mixing between A, and #4 to make this 
interpretation less clear. To remove this mixing term, we will parametrize the 
complex field in polar variables and shift only the modulus field 


l 
(x) = V2 


l . 
= WA [v + n(x) + ičé(x) + ...]. (8.90) 


Thus, for small oscillations, n(x) and €(x) are really ${(x) and @;(x), 
respectively. The free Lagrangian also keeps the same form 


Lv + n(x)] exp(ic(x)/v) 


] 2 
Ly = 5 OM? — 0l- 5 O + ). (8.91) 


The canonical quantization conditions are not changed; n(x) and €(x) have 
the same particle interpretation as @, and @). 

We can now remove the unwanted term (8.89) by transforming @3(x) or 
&(x) away or, more accurately speaking, by fixing the gauge (the unitary 
gauge). To do this, we define new fields 

u g l 
p(x) = exp(—1¢/v) (x) = wD 


He = sb ce: (8.92) 
gu 


(v + n(x)) 


From the property of gauge transformation (8.80), we have 
D,ọ = exp(—1¢/v)(0,$" — igB, ¢") 
= exp(—1¢/v)(0,n — 19B,(v + n))/ V2 
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and 
ID, P|? = 216,n — igB,(v + n)? (8.93) 
and also 
Fy = 0,B, — 0,B,. 
The Lagrangian of (8.78) may be written 


l . u’ À 
=; lð n — igB,(v + n)? + oe ny — gut n)* 


1 
4 (0, B, = ôB,” 


== Lo F fi 
with 


l 1 ] 
Lo = 5 (Gun) — pn? — 7 (GyBy — By)” +5 (gv) BB" 


l 
ZJ = 59° B,B'n(2v +) — Av*n? — j Ant. (8.94) 


It is clear that Y is the free Lagrangian density for a massive vector boson 
with mass M = gv and a scalar meson with mass m = ./2u. The field €(x) has 
disappeared from the Lagrangian. This may be less surprising when we count 
the degrees of freedom. Before spontaneous symmetry breaking, we had two 
scalar fields @, and ġ, and one massless gauge boson A, (with only two 
polarization states). After the symmetry breaking, we have only one scalar 
field y and one massive gauge boson B, (with three polarization states). Thus 
the massless gauge field A, combines with the scalar field č to become a 
massive vector field B, in (8.92). This is the Higgs mechanism for the Abelian 
case. The €(x) field is called a would-be-Goldstone boson. 


Non-Abelian case 


It is straightforward to generalize the Higgs mechanism to theories with non- 
Abelian gauge symmetry. Consider the case of an SU(2) gauge theory with a 


Pr 
$ 


2 


complex doublet of scalar fields @ = 
L = (D ADG) — V(b) — FF” (8.95) 
where 
T 
D, = (o, — 1g 5) A) 
F4, = 0,A$ — ô Ag + ge ALAS 
Vp) = —W°(G"'d) + AG"9)?. (8.96) 
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For u? > 0 the classical potential is a minimum at 
<o'b)y = v?/2 with v = (WÈ. (8.97) 


We can choose the physical vacuum corresponding to the expectation value 
having the form 


1 /0 
<P)o = W (C) (8.98) 
If we define the new field 


' = $ — <$>o; (8.99) 


then <¢ġ'o = 0. The covariant derivative term will generate a mass for the 
vector boson field since 


, 
(D,6)'(D,4) = (6, -ig sao + o) 


x (( -i95 arjo $ <6) (8.100) 
contains the factor 
PONE AeA. = 3 (2) AA (8.101) 
corresponding to A, having a mass 
i= (8.102) 


In the scalar sector, we have 


p'o = p7p + <b 0d’ + P lpo + <P 0H> 0 


(HP = pp + (Kb"oh!’ + p H)0)* +... (8.103) 
Writing ọ' = kal the term quadratic in ġ’ is 
2 
Av* 7 1t\2 u’ , 1 ty\2 
=a Ort On) aia VATE) (8.104) 


This means that only the combination (63 + ¢4')/./2 is massive (physical 
Higgs particle). The other three states ¢), ¢', and (¢5 — ¢4,')//2 are the 
would-be-Goldstone bosons, which will combine with the original three 
massless gauge bosons to become three massive vector bosons. 


Unitary gauge (non-Abelian case). To see this explicitly we go to the unitary 
gauge. We parametrize the scalar doublet 


0 
z T v + n(x) 
o(x) = exp] x xa ( 72 ) (8.105) 
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where <é*>9 = “n>, = 0. We can then define the new fields by 


l 0 
"(x)= U = —~ 8.106 
$'a) = UAD = (, | , (8.106) 
T T z i _ 
a Pe UO A Oe OOO) eID 
where 
U(x) = api Eto} (8.108) 
From the properties of the gauge transformation, we obtain 
D,¢@ = U~'(x)D,¢" (8.109) 
Foo = GG” (8.110) 
where 
h T ul 
Do" = (2,-iv5-B,)¢ (8.111) 
G5, = („B$ — 0,B4 + ge” Bi BS). (8.112) 


Then the Lagrangian density in the unitary gauge is simply 
À l 
L = (D pY (DG) + 5 (+n) -Fet -36aG™. (8.113) 


The first term contains a term quadratic in B, 


2 2 
g 0 1 /gv 
g (0, v)(t ` B, T° Bo?) = 7 (2) B_B’. (8.1 14) 
We have vector particles with mass Mpg = gv/2. Thus the original SU(2) 
gauge symmetry is completely broken; all three gauge fields acquire mass. 


Pattern of symmetry breaking 


It is important to keep in mind that the pattern of symmetry breaking is not 
arbitrary but depends on the structure of the theory in particular the (group) 
representation content of the scalar field (Kibble 1967; Li 1974). For 
example, if we have a triplet of real scalar fields @ instead of the complex 
doublet, the gauge symmetry SU(2) will be broken down to a residual U(1) 
gauge symmetry with one massless vector boson remaining. To see that this is 
the case, start with the scalar potential 


V(b) = -Kp + Ao. (8.115) 
Again, minimization of V(®) only determines the magnitude 


Kbyol = 0//2 with v= (w2/A)?. (8.116) 
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We are free to choose the vacuum state so that 


0 
] 
(bo = 5 0 |. (8.117) 
v 


The ground state ® points in the 3-direction, the symmetry is spontaneously 
broken. The covariant derivative term 


[D.o]? = [0,6' — iglo + <)o) x Ay] (8.118) 


will not contain any term quadratic in A>. Thus the A? field continues to 
describe a massless vector boson. We note that this pattern of symmetry 
breaking is related to the fact that <@), of (8.117) is still invariant under an 
O(2), i.e. a U(1), rotation in (1, 2) space. 


The number of massive gauge bosons 


Since the number of massless gauge bosons corresponds to the number of 
generators of the (unbroken) gauge symmetry group, the number of gauge 
bosons that become massive (or the number of would-be-Goldstone bosons) 
is equal to the difference in the number of generators of the original 
symmetry and of the final symmetry. We shall present a proof of this 
statement; this also will provide us with a chance to introduce some general 
formalism. 
Consider the general Lagrangian density 


L =35[(0,0; + igT{, Aid, Ep: — igT4,A";,)] 
— Veh) — Fi F” (8.119) 


where @; is a set of real fields, transforming according to some (possibly 
reducible) representation of the gauge symmetry group G with n generators 


Qix) > P(x) = Qix) + 1e(x)TEO(X), a=1,2,...,n. (8.120) 


Given that the potential in (8.119) is invariant under an arbitrary group 
transformation (if the potential is invariant under a larger group, there will 
be scalars which become massive only through radiative corrections. They 
are often referred to as the pseudo-Goldstone bosons (Weinberg 1972b)), 


OV OV 
0 = ôV = — 6¢, = & — Tio, 
ag, OO ag, 
or 
OV 
a6, 18 j—Y, cn ne 7 (8.121) 
Differentiation gives 
ôV OV 
sen Eth a) 
ab: Ob; i? + 3G “= O (8.122) 
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If V is a minimum at ¢; = v;, then the second term in (8.122) vanishes, and 
07V 

0p; OP, 

We should remark that, in the global symmetry case, (8.123) corresponds to 

the statement that a number of scalars are massless. That this second 


derivative matrix corresponds to a mass matrix can be seen by an expansion 
of V(@;) around the minimum 


a a, 
Lt 
Qi=r, 


=.) (8.123) 


1 yV 
V(d;) = Vw) + - ———— -— 0; — y Tot 8.124 
(ġ;) (v;) F > 0d; 0d, Pa (Qi Uv; (Dx Uy) zE ( ) 
Thus the mass matrix 1S 
07V 
(M°) = ———— (8.125 
Mt 56, Oa loa 


Now suppose G has a subgroup G’ with n’ generators that leaves the vacuum 
invariant 


T;v,=0 for b=1,2,...n' (8.126a) 
and 
Tav AO for c=n +l,...,n. (8.126b) 


If we choose T° to be linearly independent, eqns (8.126), (8.125), and 
(8.123) clearly imply that M? has n — n’ zero eigenvalues and hence (n — n’) 
Goldstone bosons. In the gauge symmetry case, these (n — n’) massless states 
correspond to the (n — n’) would-be-Goldstone bosons. In preparation for 
gauging them away, we re-parametrize @; by 


Qi = exp{i T; é (x)/v}(v; + n;(x)) (8.127) 


where c = 1, 2,..., (n — n’), 1.e. we sum over the broken generators. v in the 
exponent is the magnitude of v;. The n;(x)s are the remaining scalar fields 
which are orthogonal to é°s. After a gauge transformation. A, > A, with 
gauge function (x) = —iT°é‘(x)/v. The Lagrangian has a quadratic term in 
As 


2 
-5 (Tv, Tv) AeA, (8.128) 


After diagonalization this leads to (n —n’) massive vector bosons. To 
summarize, the number of would-be-Goldstone bosons is equal to the 
difference in the number of generators of the original and the final gauge 
symmetries. 

In this section we have chosen to fix the gauge so that the particle content 
of the theory is obvious (the unitary gauge). In the next chapter, at the 
end of §9.2, we shall also discuss another class of gauge choices (the 
renormalizable gauge or the R gauge) where the would-be-Goldstone bosons 
are not eliminated explicitly but the gauge-field propagators manifestly have 
good high-energy behaviour and the renormalizability of the theory is more 
transparent. 


Quantum gauge theories 


WE now proceed to quantize the gauge theories, explain their perturbative 
solutions, and discuss the generalized Ward identities of such theories. 


9.1 Path-integral quantization of gauge theories 


Gauge theories, being gauge invariant, represent systems with constrained 
dynamical variables, i.e., there are variables that do not represent true 
dynamical degrees of freedom. The quantization procedure of such theories 
is more involved than that for the scalar field theory discussed in Chapter 1. 
For gauge theories the path-integral formalism provides the most direct 
quantization procedure. 


Difficulties of gauge theory quantization 


We are already familiar with the problem of quantizing the electromagnetic 
field A,,(x). In the canonical formalism one identifies the canonical variables 
A,(x) and their conjugate momenta n,(x) = 6L(x)/d(09A"(x)) as operators 
and postulates their commutation relations. One immediately discovers that 
To(x) and V - x(x) vanish, which implies that A(x) and V - A(x) commute with 
all canonical operators. They are really c-numbers. The four-vector field 
A,(x) actually represents only two independent dynamical degrees of 
freedom. The canonical commutation relations for these transverse fields 
A (x) and 1, (x) have to be formulated so that they are compatible with the 
above-mentioned constraints. For example, we can take the constraint in the 
form of V- A(x) = 0 (radiation gauge) or A3(x) = 0 (axial gauge). In such 
formulations one sacrifices manifest Lorentz covariance. Alternatively, one 
maintains explicit Lorentz covariance and introduces spurious degrees of 
freedom into the theory. This brings about a Hilbert space with indefinite 
metric (the Gupta—Bleuler formulation). A physically sensible theory is 
recovered only after we restrict the admissible states to those satisfying (the 
Lorentz gauge) 0"A,|¥> = 0. The key point in all these formulations is that 
one must remove the redundant degrees of freedom (resulting from gauge 
invariance) of the theory by some acceptable gauge-fixing conditions. In the 
language of path-integral quantization formalism, one must restrict the 
functional integration to reflect these gauge-fixing conditions. A consistent 
implementation of such constraints for non-Abelian theories is a highly 
nontrivial matter, and the problem was finally solved through the work of 
Feynman (1963), DeWitt (1967), Faddeev and Popov (1967), and many 
others (Mandelstam 1968; Popov and Faddeev 1967; Veltman 1970; ’t 
Hooft 1971a, b). 
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We shall restate the difficulty of quantization directly in terms of the path- 
integral formulation of gauge theories. To be specific, we consider the case of 
SU(2) Yang-Mills fields 


Paak FO, a= 1, 2,3 (9.1) 
with 
F, = 0,48 — 0, AS + ge ALAS. (9.2) 


If we write the generating functional as 


WJ] = | [dA,] exp} | d+x[ F(x) + ZORTE (9.3) 


the free-field part is then 


W [J] = fraa, exp} [asz + J,,(x) aroo} (9.4) 
with 


l 
| dix A(x) = E [aaas — 0 Ap )(0" A7 — 0” A) 


5 | dx A(x) (g 8? — " OAA). (9.5) 


Now we have a situation very similar to the scalar field theory and we would 
like to proceed and perform the Gaussian integration as in eqns (1.79) and 
(1.81) 


[a exp[ —2¢¢Ko> + <Jb>] ~ exp(JK~'J>. (9.6) 


l 
vdet K 
However this is not possible, because the operator 

Kv = Ju © — 6, 6, (9.7) 


in (9.5) does not have an inverse, as we shall demonstrate below. 
Assuming G’“(x — y) is the inverse of K,,, 


(Guy © — ô, 0,)G*(x — y) = g; +(x — y). (9.8) 


Using the Fourier transform 


G**(x) = d*k e7 “GrA(k) (9 9) 
-J$ 
we have 
(—k*g,, + k,k,)G*(k) = gi. (9.10) 


With the invariant decomposition 


GHk) = alk?) g’* + b(k2)k°k?, (9.11) 
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it is clear that the left-hand side of eqn (9.10) = —a(k*)(k?gi — k,k*) cannot 
be equal to the right-hand side. Thus K,,, does not have an inverse. Another 
way to see this is that the operator K,, given in (9.7) satisfies the relation 
K,,,(x)K}(x) oc K,,(x). This means that it is a projection operator (which 
projects out the transverse degree of freedom of the gauge field) and clearly 
does not have an inverse. Equivalently, for the case det K vanishes and eqn 
(9.6) is not applicable. This singular nature of the path integral, 1.e. this extra 
infinity, is related to the gauge invariance of the theory. In eqns (9.3) and 
(9.4) we have summed over all the field configurations, including ‘orbits’ that 
are related by gauge transformations. This overcounting is at the root of the 
divergent functional integral. We need to seek a prescription to divide out 
this infinite (functional) volume of the orbit. To quantize a gauge theory, it is 
necessary to fix the gauge. 


Isolating the path-integral volume factor 


(1) A two-dimensional case as illustrative example. Before launching into the 
actual calculation that will isolate this volume factor from the functional 
integration (hence in infinite-dimensional space) we shall use a two- 
dimensional integral to illustrate our strategy 


W = [ax dy eS 


z | dêr 5) (9.12) 
where r = (r, 0) is the label in the polar coordinate system. S(r) is supposed 
to be invariant under a rotation in two-dimensional space 

S(r) = S (rẹ) (9.13) 
for 

r= (r, 0) > r = (r,0 + 9). (9.14) 


Thus S(r) is a constant over the (circular) orbit. In this simple case if we only 
wish to sum over the contribution from the inequivalent S(r)s we can simply 
divide out the ‘volume factor’ corresponding to the polar angle integration, 
f d0 = 2x. To do this we adopt the following procedure which can be 
generalized to more complicated situations. First we insert 


l = [as ô(0 — ¢) (9.15) 
into the original expression for W 
W = [ae {ar e 6(0 — p) = [aow, (9.16) 


where W, = f dr (0 — ) ec is evaluated for a given angle p. Thus, we first 
calculate W along a fixed angle 0 = @, then integrate over the contributions 
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c beg © g(r) =0 
(a) (b) 


Fic. 9.1. 


for all values of @ (see Fig. 9.1(a)). Using the invariance property of S in 
(9.13) we have 


Thus the volume of the orbit can be factored out 


= 2n Wp. (9.18) 

Generally a constraint that is more complicated than 0 = @ may be chosen, 
and we represent this by 

g(r) = 90 (9.19) 


which intersects each of the orbits once as shown in Fig. 9.1(b), i.e. the 
equation g(r,) = 0 must have a unique solution ¢ for a given value of r. For 
this general constraint (9.19), instead of the simple eqn (9.15), we need (to 
define) a function A,(r) such that 


[4,0] = | do ol g(t,)]. (9.20) 
Hence 
A,(r) = “a K (9.21) 


and A,(r) is itself invariant under the two-dimensional rotation (9.14) since 
[Arg] * = fas OLg(tg+¢)] 


= [ao OLg(ty)] 


= [4,0] }. (9.22) 
Repeating steps (9.16)—(9.18) the volume factor in W can then be isolated 


W = | dow, (9.23) 
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with 
W, = [a e A (r) dL g(r4)]- (9.24) 


W, is rotationally invariant 


Wy = | dre A (r) SLolty)] 


_ | dr’ e™ A (r) SLg(t)] 


= Wy. (9.25) 


where we have introduced the variable r’ = (r, ġ') and have used the fact that 
S(r), A,(r), and the integration measure dr, are invariant under rotations. 
Thus to remove the ‘volume factor’, we can insert a constraining 6-function 
and multiply it by a function A, defined by (9.20). 

(2) The PI volume factor in gauge theories. We now return to the task of 
isolating the actual volume factor in the functional integration of the 
generating functional in gauge theory. The procedure will be exactly the same 
as for the simple case we have just discussed. The action is invariant under 
the gauge transformation 

Ay Al 


where 
A’ 2 VOJA, "t/2 + = U~'(6) 0,00) [UO 


with 
U(@) = exp[ —10(x)- 1/2]. (9.26) 


Os are the space-time dependent parameters of the group. The ts are the 
Pauli matrices. Thus the action is constant on the orbit of the gauge group 
formed out of all the Afs for some fixed A, with U(6) ranging over all 
elements of the group SU(2). A proper quantization procedure must restrict 
the path integration to a ‘hypersurface’ which intersects each orbit only once. 
Thus, if we write the equation for the hypersurface as 


fAA,)=0 a=1,2,3, (9.27) 


then the equation 
JAA) = 0 (9.28) 


must have an unique solution @ for a given A,. Eqn (9.27) is clearly a gauge- 
fixing condition. 

We also need to define the integration over the group space. Let 6 and 6’ be 
elements of an SU(2) group. In terms of the representation matrices U(@) the 
multiplication of group elements takes on the form 


U(0)U(6') = U(66'). (9.29) 
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In the neighbourhood of identity, we can write 
U(0) = 1 + ið -1/2 + O(07). (9.30) 


The integration measure over group space can be chosen as 
3 
[dé] = [| d0, (9.31) 
a=1 
which is invariant in the sense that 


d(06’) = dé’. 


We can now isolate the desired volume factor by defining a function 
A,[A,] 


A; ‘[A,] = [raa OL SAA) I- (9.32) 
Thus 
A,;({A,] = det M; (9.33) 
where 
_ Sa 
Mpa = 56, (9.34) 


Thus, M, is just the response of f,[A,] to the infinitesimal gauge 
transformation. More precisely, from (9.30), the infinitesimal gauge trans- 
formation is of the form 
1 
AP = ACen A, = ; 0 ,0° (9.35) 


u 


and the response of f,[A,,] is 


SaLARCO] = fLA,(x)] + [asem s, YYlaða(y) + O(87). (9.36) 
Because of the requirement that eqn (9.28) have a unique solution, (det M+) 
does not vanish. 


A,{A,] has the important property that it is gauge invariant. To see this, 
we write (9.32) as 


A; '[A,] = [raro OL f(A, I: (9.37) 


then 


A; [A] = [raven OL fa(An (x) 
z | [d(O(x)6'(x))] SL (AR? (x))] 


= | [do"(x)] OL (AN (x))] 


= A-'[A,]. (9.38) 
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We now substitute (9.32) into the path-integral representation of the 
vacuum-to-vacuum amplitude 


[aa epi; [asgo] = [[a9oo1taa,o0 A,[A,] 
x OL f,(Aj)] exp fi [aszw} 
= | [d0(x)][dA,(x)] A LA] 


x OL f,(A,)] xpi [iszol 
(9.39) 
To arrive at the last line we used the fact that both A,[A,] and exp{i f d*x 
Y(x)} are invariant under a gauge transformation Aj, > A,. Now, the 
integrand is independent of 6(x) and the integration over II, d@(x) is the 
infinite orbit volume we have been seeking to identify. This suggests that the 


prescription for the generating functional of the gauge field A, (after 
applying eqn (9.33) and eqn (9.39)), should be 


W [J] = | [dA,](det M,) ôL/(A,)] exp | d4x[ L(x) + ary 
(9.40) 


This is the Faddeev—Popov ansatz (1967). In other words, we can get rid of the 
unwanted redundancy in the quantization procedure by restricting the 
functional measure with det|d//d6| ôL f(A,)]. 


Consistency check of the FP ansatz in axial gauge 


Before proceeding further with the formalism we shall make an elementary 
check of the FP ansatz with a specific example. Consider the following choice 
of the gauge-fixing condition (9.27), the axial gauge (Arnowitt and Fickler 
1962) 


f,=A5=09. (9.41) 
Under the gauge transformation (9.35) we have for (9.36) 


1 
JAA!) = AS + e™0AS — ; 046 
1 a 
=- 050 (9.42) 


because of (9.41). Thus we have the response matrix M; = (—1/g) 03 On, 
which is independent of the gauge field. For this choice of the (axial) gauge 
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we can therefore ignore the (det M,) factor in W,[ J] 


W [J] = | [dA,,] 6(A3) expfiS[J]} (9.43) 


S[J] = [asien aa | 


It is more convenient to work with an alternative form of the generating 
functional 


WLI] = [tar araa, 0(A3) exp{iS’[J]} (9.44) 
with 
S'[J] = [aster + 4F"'(0,A% — 0,A2 
+ ge ABAS) + JEA"). (9.45) 
If we integrate over Fiv, W-[J] reduces to W,[J] of (9.43). 


pv? 


Let us check the compatibility of the FP formulation eqns ((9.43) and 
(9.44)) with the canonical quantization, to see whether it does restrict the 
functional integration to the same dynamical variables as deduced with the 
canonical procedure. 

We first identify the independent canonical variables in the axial gauge 
A$ = 0. The Lagrangian in (9.45) becomes 


L! = UFa) + iF” (0 AS — 0,47 + ge APA) 
+ FOG, A} — 6,46 + ge AGA) 
a F534 — 603A?) a | cau | —03A6) (9.46) 


where i,j = 1,2. 
The Euler-Lagrange equations 
Of" Of" 
tae H (9.47) 
ÒO Fi) oF hy 


and 
pOL" Oe 
6(0*A%) ôA! 


(9.48) 


give rise to the following constraint equations (having no time derivatives) 
a a a abc 4b 4c 
Fiz = — 034i 
03 = — 0340 


OFS. — 63F%, = — ge™ F? A" (9.49) 
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and to the following dynamical equations 
6; = Oo Af — 6,;A6 + ge AX AS (9.50a) 
OMFG, = —ge? (FR A" + Fb;A%). (9.50b) 


Thus 40, Fi, Fis, and Fò are constraint variables; they can be eliminated 
from S’ (resulting in S”) by the constraint equations (9.49) in terms of the 
remaining variables 4f and Fĝ:. This identification of independent canonical 


variables leads us to construct the generating functional 


WLI] = | (ao, 2CaosIC4A, JaAn] exptiS”[J]}. (9.51) 


Our consistency check of the FP ansatz now consists in showing the 
equivalence of this functional integral to that in (9.44) and thus (9.43). We 
need to show that, if a dynamical variable appears at most quadratically 
(with a constant coefficient) in the action, then integrating over the variable is 
the same as eliminating it from the action by the Euler-Lagrange equation. 
This is indeed the case and we can illustrate this theorem as follows. Consider 
the functional (Gaussian) integral 


| [do] exptiS[¢]} {a0 expli [aae i ooo] 


apl- | d+ xf roo? (9.52) 
On the other hand, the Euler-Lagrange equation from S in (9.52) yields 
ag(x) + f(x) = 0. (9.53) 


Thus eliminating $(x) in S, we have 
l 
= -a [O (9.54) 
2a 
which is the same as eqn (9.52). This also completes our demonstration that 
the FP ansatz indeed provides the correct restriction (i.e. the same as 
canonical procedure) on the integration measure. As we illustrated in the 


introduction to PI formalism in Chapter 1, the Hamiltonian PI formalism 
where 


WEJ] ~ | [dd dz] exp | d*x[ 209 — #(n, p) + m (9.55) 


is equivalent to the Lagrangian PI where 


WEJ] ~ [ree exp [ost ôo) + sei} (9.56) 


It is not difficult to check that (9.51) is the Hamiltonian formulation with Fô; 
being the transverse canonical momenta. 
Because we can drop the FP determinant with this choice of axial gauge, 
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the quantization is particularly simple. But in this gauge we lose manifest 
Lorentz invariance and Feynman rules are complicated. 


Abelian gauge theory 
We should remark that all the formalism developed in this section also 


encompasses the simpler case of Abelian gauge theory. Under a U(1) gauge 
transformation, eqn (9.35) reads 


A%(x) = A,(x) — ; 8,0(x). (9.57) 


It is then clear that for any choice of linear gauge-fixing condition of (9.27) 
the response matrix M, in (9.34) or (9.36), like the special case of non- 
Abelian theory in the axial gauge just considered, will be independent of 


A,(x). The FP factor (det M,) plays no physical role and can be dropped 
from the generating functional, 


W LJ] = [raas OL f(A,)] exp [atzo aE J,A}: 


(9.58) 


9.2 Feynman rules in covariant gauges 


For practical calculations it is more convenient to use the covariant gauges 
where unlike the axial gauge unphysical ‘ghost fields’ are needed. We start 
with the generating functional (9.40) in the form 


W{J] = | [dA,] aplis + | atta, (9.59) 


Thus, the FP modification of the integration measure det M, ÔL f,(A,,)] can 
be exponentiated and expressed as additional terms in the action, leading toa 
new S.p. In this language, the problem of gauge field quantization is solved 
because these new factors lead to a new K operator for the prototype 
Gaussian integrand in (9.6), which will have a nonvanishing determinant and 
possess an inverse. 


Faddeev—Popov ghosts 
It is straightforward to write det M, in an exponential form, 

det M, = exp{tr(In M,)}. (9.60) 
If we further write 


M,=1+L, (9.61) 
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then 


l l 
exp{tr(in M,)! = expyt L+ zt! L? +... tr L" +.. : 
n 


1 
= epi [astub x) + J [ats d*yLap(x, y)Lyaly, X) +.. | 
(9.62) 


which is represented diagramatically in Fig. 9.2. 


L b L 
b c 
C) + L L+ & + ees 
L L 
a a a 


FIG. 9.2. Diagrammatical representation of the Faddeev—Popov determinant. 


This series may be viewed as arising from loops generated by a fictitious 
isotriplet of the complex scalar fields ¢(x). Their presence and interactions 
can be described by the generating functional 


det M; ~ [tee [de"] exp fi [ats dy Y cg) LM (x, mwas) 
a,b 
(9.63) 


Because the Gaussian integral is proportional to det M, as in the case for the 
Grassmann number, rather than (det M,)~', we see that the scalar fields e(x) 
must obey Fermi statistics (recall the discussion in §1.3, especially eqns 
(1.139)}-(1.141)). They are referred to as Faddeev—Popov ghost fields. 


Gauge-fixing terms 


We next attempt to convert the delta function 6[ f,(A,,)] into an exponential 
factor. This can be accomplished by first generalizing the gauge-fixing 
condition f,(A,,) = 0 to 


fal Ay] = B,(x) (9.64) 


where B(x) is an arbitrary function of space and time, independent of the 
gauge field. The definition (9.32) of A, is correspondingly generalized 


[raon A, [Ay] OL f(An) — By(x)] = 1. (9.65) 


Clearly this definition yields the same A, as in (9.32). And we can extract the 
infinite-orbit volume factor as before and prescribe a generating functional as 


WLS] = [a4,q1aB ae M p) OL falA,) — Bal 


x xpi [ats] 209 — J’-A,— z |b (9.66) 
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where we have inserted a constant 
constant ~ [ra exP| — 3 faset 


where č is some arbitrary constant coefficient, the gauge parameter. The 
generating functional of (9.66) differs from that of (9.40) by an immaterial 
normalization factor. We can then use the delta functional to perform the 
integration over [dB(x)]. Also, substituting in (9.63), we have 


W{J] = [aA,ataeatae" exp{iSer[J]} 


with 
Serl J] = SLI] + Sor + SFera (9.67) 
where the additional terms are the gauge-fixing term 
l 
Sor = -z [acs fal A,(x)]}? (9.68) 


and the FP ghost term 
SFPG = [ats d*y Yich(x)LM (x, Y) lac). (9.69) 
a,b 


Covariant gauges in symmetric gauge theories 


Here we shall make a specific choice for the condition in (9.27), i.e. (9.64) (the 
covariant, or Lorentz, gauges) 


SAA,) = 0A = 0 os 12.3: (9.70) 
Under the infinitesimal gauge transformation 
U (O(x)) = 1 + i0(x)-t/2 + O(87) (9.71) 
A(x) = Aa(x) + 6°6°(x)A,(x) — - 6,,0%(x), (9.72) 
we have 
JAR = SA) + ô emo (x) Ai (x) — - 0,09 | 
= f(A,) + [sma yan (y) (9.73) 
with 
[M (x, y) lap = -7 O“[6" ð, — ge™ AS] ôt (x — y). (9.74) 


From (9.70) and (9.74) we can calculate the extra terms (9.68) and (9.69) in 


260 Quantum gauge theories 9.2 


the effective action S., 


Sis -5. | d4x("A,)? (9.75) 
Sreo = - | dtx Y ch(x) [5a Oy — Jane ATC). (9.76) 
a,b 


Introducing the source functions n}, n, for the ghost fields c, and c}, we can 
write the generating functional 


l 
WLI, n, n] = fraa, de de") exp Ja 209 -z (0" AS)? 
ag Ch O"LOap On = JEqoeAylCy 
+ STAM tte 4 tet | (9.77) 


where we have redefined ¢ and c’ to absorb the 1/g factor in M,. 


(A) Perturbation expansion in covariant gauges. To do the perturbation 
expansion first for a pure Yang—Mills theory, we decompose Ser = So + Sı 
where the free action is quadratic in the fields, 


l I 
STe | CE CGA — ô, A} — (3! At)? 
+ ch Oc, + JEA 4 tet + rte" | (9.78) 


and the remainder is the interaction term 
SLA, c, ec] = | d*x[ —3(0,A% = 0,A%) ge AA” 


Ta age ete AL AS AHAS” 


— igc™ Ota ASe]. (9.79) 
The generating functional can then be written 
WIJ, n, n] = exp4iS EDG a2 WSEJIW? Ln, n] (9.80) 
IN | idd,,” in isnt) 4 vee 


with 
l 
wW9{J] = [rea exp [sl -3 (0,4% — 0,42) 


l 


u a2 a jau 
5g (6 Aty + JaA {| 


W°Tn, n] = [rae exp =i | d*x[c" 87c" 


= nc" _ reti} (9.81) 
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(B) Propagators. To calculate the propagator for the A, field, we rewrite 


W°? as 
l 
WSJ] = | [dA,] exp4i [as AS( g" 0? 
— |] 
= z 0" 0”) bqyAy + sam |l 
l 
= fraa, exp4i [ats]; ASKA} + sar | (9.82) 
with 


l 
ks- [æe-(1-Yre]aa 


which possesses an inverse and we can use (9.6) to integrate over [dA, ], 


WIJ] = exp = [ats d* y TeX) GRx — DEE (9.83) 


v dtk —ik(x-y v k*k* 
G(x — y) = Ô ab p e k(; IEG oa j2 ) 


where 


: k*k* l 
ko Ik Ane 
It is easy to check that 
[aska — V)GY(y — 2) = gi ô; ô*(x — 2). (9.84) 


Similarly, we find 


Wen, n'] = exp} =l [ats dyn (x)G(x — yt} (9.85) 


where 
dtk ares) 
G _ o B nas , 
(~~ ¥) (Qn)* k +ie ” 
Thus we have the Feynman rules. 


(1) Vector boson propagator 


re : y k K; ] a b 
IAk) = Bui o" — (1 = Č) 2 le Ac i per 
(11) FP ghost propagator 
i a ` eocvccedoccce 
iA (k) = — iÔ ab ie aie. a 


A ghost field line, like that for a fermion, has directions. Thus a ghost 
is distinct from its antiparticle. 
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(C) Gauge field couplings. For non-Abelian theories there are self- 
couplings among the gauge fields, with polarization vectors é"(k) 


et(k i)e” (kajel k PSCk, k2, ka) (9.86) 
and 
e” (k; Je” (kz)e? (k3)? (ka rR (ki, k2, k3, k4). (9.87) 


The Feynman rules for the vertices (T's) follow from S; in eqn (9.79). One can 
work them out by a straightforward application of the procedure outlined in 
Chapter 1 (for the Ad* theory). But they can just as easily be deduced from 
their symmetry properties under the interchange of gauge fields. Such a 
derivation also helps us to remember their structure. In momentum space the 
first term of (9.79) has the form 


F S a 
z4 (k 1)A”(k2)A* (k3) ialki, k2, Ks) (9.88) 


where the As are the Fourier transform of the gauge fields, and [ave is the 
Feynman rule vertex of (9.86) which must be totally symmetric under the 


interchange of As. The SU(2) structure is already fixed, 
ros (ky ’ kz ’ k3) = eT alk, ) kz, k3). (9.89) 


The Lorentz structure can then be deduced. It is clear from (9.79) that 
Favalki, k2, k3) is made up of terms like k,,g,,. The precise combination can 
be worked out from the condition that T’,,,(k,,k2,k3) must be antisym- 
metric with respect to index interchanges: uv, 1 «2, etc. since &™ is 
totally antisymmetric. In this way we find 


ky .p,4 k3,A,c 
(iii) ives = ige™ [(k; tS k2)i9 wv 
+ (ka — k3)uGva + (k3 — Ki) Gua] 
with 
kı +k,+k3=0. ko. v,b 


Similarly for the quartic gauge-field self-coupling in (9.79) we have the vertex 
(iv) Iie Gage ogy) Ree “eee 

+ EEG Dip — Dav Iup) 

+ aa cate (0 Fe? Pe Bi IpaGnv) 1 

. k 3 > b 

with oe oe 

kı +k,+k,+k,=0. 
For the covariant gauge vertex which couples the ghost fields to a gauge field 
with polarization vector "(kų + k,) we have a b 


(v) ir% = ge fc, kd, alk, 


Z 
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Note the asymmetric appearance of this vertex. One should preserve a 
consistent convention of entering the momentum of either the left or the 
right ghost line at every vertex. The ghost only enters in closed loops. 
Topologically for every diagram with a gauge-field closed loop there is 
one with a ghost loop in the same place. Most importantly, like the case 
of a fermion loop, we must insert an extra minus sign for every ghost 
loop. 


We note that only the propagator depends on the gauge parameter € and 
we can make suitable choices of € for specific purposes. Within this class of 
covariant gauges, the choice of € = 1 is called the °t Hooft—Feynman gauge; 
€ = 01s the Landau gauge. 


(D) Fermions. It is straightforward to add fermions to the pure Yang- 
Mills theory considered above: we merely insert in the Lagrangian all the 
possible gauge-invariant terms that have dimension less than four 


L, = wiy"D, — mw (9.90) 
where 


DW = 0, — ig T" AGY. 
T° is the representation matrix. For example, if y is an SU(2) doublet, 


T°? = t*/2. We then have the additional Feynman rules involving fermions 
(with group indices n, m,...). 


(vi) Fermion propagator 


1Anm(k) a Onm —— a - = 
v:k—m+ie 


(vii) Fermion gauge boson vertex 


i, = 19(T amy" ia 


R: gauges in spontaneously broken gauge theories 


Finally we come to the covariant-gauge Feynman rules for gauge theories 
with spontaneous symmetry breakdown. First, let us consider the case of 
Abelian symmetry. Recalling the discussion of §8.3, it is desirable to 
eliminate the mixing terms of eqn (8.89) 


gu" 8,3 (9.91) 


where @, is the would-be-Goldstone boson field. There, we choose a gauge, 
the unitary gauge, so that @, is absent from the theory. Thus, ¢@, can be 
identified with the phase of the complex scalar field and this fictitious degree 
of freedom can be eliminated by a gauge transformation. The advantage 
of the unitary gauge is that the particle content of the theory is manifest; all 
we have in the theory are the physical states of the real Higgs particle and the 
massive gauge boson, which has the propagator normally expected for a 
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massive vector field (with mass M = gv) 


| —i(Guy — kyky/M”) 
iA, ,(k) = ELM cae (9.92) 
We have already mentioned in Chapter 2 that a theory with such a spin-1 
propagator seems to be unrenormalizable from the power-counting 
arguments—unless there are hidden cancellations among Green’s functions. 
However, from the observation that the original Lagrangian before spon- 
taneous symmetry breaking is renormalizable by power counting, °t Hooft 
(1971a,b) proved that the theory remains renormalizable even after the 
symmetry breakdown. The key is to choose another set of gauges, the 
renormalizable gauges, in which the theory has good high-energy behaviour. 
The point is that in the unitary gauge, although the particle content is simple, 
renormalizability is not transparent, as the finite S-matrix only results from 
cancellations among divergent Green’s functions. But the theory should be 
equivalent to that in the renormalizable gauge, where we obtain propagators 
with mild high-energy behaviour at the expense of introducing fictitious 
particles (the would-be-Goldstone bosons). Thus in the renormalizable 
gauge, unitarity is not manifest and we have to check that the spurious 
degrees of freedom do cancel in the physical amplitude. Such theories have 
been described as being ‘cryptorenormalizable’. 

A general class of renormalizable gauges may be represented by choosing 
the gauge-fixing condition (9.64) as 


J (Au $) = (A, + CMO) = 0 (9.93) 
where € is an arbitrary parameter. Then the gauge-fixing Lagrange density 
(9.68) is generalized to 

l 


Sf of = -z (0A, + Mo) (9.94) 


which is added to the original Lagrangian. In this way the mixing term (9.91) 
is eliminated without transforming away the @, field. Not displaying the FP 
ghost part, the free Lagrange density is given below where the ¢/, and ġ; are 
the shifted scalar fields eqn (8.87), 


1 l 
Ly = 5 (0G)? — WGP] + 5 Oo — EMG 7] 
| 4-04) +) yta ae Hay 9.95 
aes 4 uty Vy a T 2 u _ zl a) : ( . ) 


This yields 
(viii) Higgs scalar propagator 


O ET 


$$-—>-— 
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(ix) Would-be-Goldstone boson propagator 
i 


AO = a EM F ie 


> a cece 
(x) Gauge boson propagator in the R; gauge 


| = k Ky 
Awt) = + ie a ee E 


| Suv = uk M? kk, M? | 
k?—~M7+ie k =M +ie 


The interaction Lagrangian is given by 
/ / / / l / / 
Lı = GAO"? 2 — p201) + zI AA" + oY 


A 
= (67 + 62)? +gwA"A, p, — wolo + 67) (9.96) 


which yields the vertices 


(xi) @,@,A-vertex Ay 
&(p\+P2), 
ANS 
Pá P] PN 
$i $5 
AA -vertex Au 
2ig V9 uy 
N 
Na 
Ay $i 
AAgo-vertex A Ay 
„Aig guv 
L 
S N 
7 ‘N 
($5) $i ($5) 
odd-vertices $i $i 
| 
| | 
A-2idv A-6idv 
7 N / N 
yo mK 7 ~ 
A X 1 RA > 
P2 $: $i $i 
pppo-vertices. 4; $i $i ($4) $i ($3) 
N Z N / 
N / \ Z 
N S N / 
Sia 6 iA 
aa gers 


7 \ / N 


7 7 N 
$: $ $i ($2) hi ($3) 
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This class of gauges, called the R--gauge (Fujikawa, Lee, and Sanda 1972; 
Yao 1973), is characterized by a gauge parameter €. For any finite values of ¢ 
the vector boson propagator has the asymptotic behaviour 


A,y(k) > Ok?) as k>. 


Hence, from the fact that the coupling constants in the interaction 
Lagrangian have dimension 0 or 1 in units of mass, the theory is 
renormalizable by power counting. For example the particular choice € = 1 
leads to the propagator form 

. — "Wray 

Ani k? — M? + ie 

On the other hand in the limit č —> oo, the propagator for the ġ3-field (ix) 
vanishes and the would-be-Goldstone boson decouples. The vector meson 
propagator (x) reduces to the standard form for a massive spin-1 particle 
(9.92). Thus we recover the unitary gauge where there are no unphysical 
fields. 

For any finite values of € we have the unphysical singularities in the 3- 
propagator (ix) at k? = ëM? and in the gauge boson propagator (x) also at 
k? = EM”. In order to preserve unitarity, these unphysical poles must cancel 
in the S-matrix element involving only physical particles: A, and @. This is 
indeed the case. We can illustrate this with the following example (Fujikawa 
et al. 1972). For the process 


pilki) + i(k) > A(k3) + Alka), 


among the tree-level diagrams, we have those in Fig. 9.3 (as well as those with 
the final A-lines crossed). Fig. 9.3(a) due to A-exchange is given by 


. i | Guv — IuGv/M* ,/ M? 
iT = eM(k3)(2ig’v)*e kay —i)| Sy a Pee 


At the unphysical pole q? = €M?, it may be written as 
(A) —4ig’ 7 v 
1T pole = q? — EM? [ki ae" (k3) lL k2,8"(k4)I. (9.97) 


But the diagram in Fig. 9.3(b) due to @5-exchange contributes as 
i 


iT? = e"(k3)g(k, + Que (Ka)g(K2 — q), q? — EM? 
4ig? i 
= q2 — EM? — EM? [ki ,6"(k3) [kaye (k4)] 
Ap(k3) A,(k4) A (k3) od) 
a ae oe A 
Em 
/ \ 
A ` 7 X 
A k3) Ii ) AT ) 
(k,) ( 
O a ar ae oe B 
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which will cancel the unphysical pole due to A-exchange in (9.97). This type 
of cancellation of unphysical poles is very general and can be proven to all 
orders in perturbation theory by using the generalized Ward identities which 
are a consequence of the gauge invariance of the theory. In practice a good 
check on the gauge invariance of a calculation of some physical S-matrix 
element is the disappearance of the arbitrary €-parameter in the final result. 
An example of such an R; gauge calculation will be given in §13.3. 

The generalization to non-Abelian symmetries is straightforward. For 
example, the corresponding R; gauge-fixing term in the Lagrangian for the 
SU(2) example with complex doublet @ will be 


l , ae aT 5 
Lp = -zz | OPA, — 196 (P05 9 —@ 5 <P?o i (9.98) 


2¢ 
The R,-gauge Feynman rule for the SU(2) x U(1) standard theory of the 
electroweak interaction is given in Appendix B. ` 


9.3 The Slavnov-—Taylor identities 


Having outlined the quantization procedure and Feynman rules for gauge 
theories in §§9.1 and 9.2, we can proceed to make perturbative calculations. 
The regularization and renormalization procedure reviewed in Chapter 2 can 
be applied. From the Feynman rules of §9.2 we see that all couplings are 
dimensionless and that the high-energy behaviour of the propagators is such 
that the theory should be renormalizable by power counting. The divergent 
higher-order diagrams can be regularized by the dimension (d) continuation 
scheme. We can thus identify the appropriate counterterms to be inserted in 
the Lagrangian. They are of the same form as those in the original 
Lagrangian but are multiplied by coefficients which diverge in the limit 
e = (4 — d) > 0. After the addition of these counterterms, the resultant 
Lagrangian will generate, to all orders, Green’s functions that are finite when 
€ + 0. In practice, it is a very complicated programme. The resurgence of 
field theoretical studies of particle interactions in recent years was to a large 
extent brought about by ’t Hooft’s proof (1971a,b) that non-Abelian gauge 
theory is renormalizable, and that renormalizability is not spoiled even if the 
gauge symmetry is spontaneously broken. A detailed discussion of gauge 
theory renormalization (Lee and Zinn-Justin 1972, 1973) is beyond the scope 
of this book. Here we only study the generalized Ward identities of the 
Yang—Mills theory, sometimes referred to in the literature as the Slavnov 
(1972) and Taylor (1971) identities, which play an important role in the 
renormalization programme. 

The Ward identities are relations among different Green’s functions. They 
reflect the theory’s (nontrivial) symmetry (here the gauge invariance of the 
original action). These relations are important to the renormalization 
programme as they restrict the number of independent ultraviolet diver- 
gences to ensure that gauge-noninvariant counterterms are absent. Recall 
that in the simple Abelian gauge theory of QED we have, because of the 
Ward identities, the equality Z, = Z, which ensures that if two particles have 
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the same bare charges then they will also have the same renormalized 
charges. In non-Abelian theory we have many more renormalization 
constants and many more such equalities are required to ensure that the 
renormalized Lagrangian is still gauge invariant. Furthermore, as we shall 
presently illustrate, Ward identities ensure that all the unphysical singular- 
ities are cancelled in the physical amplitudes. The formal derivation of the 
generalized Ward identities for non-Abelian gauge theory will be given at the 
end of the section. 


The Ward identities and unitarity 


Consider a simple SU(2) gauge theory with fermions (f) in a doublet 
representation. The requirement that the S-matrix must be unitary, 


SSt=StS=1 or Y SaS = 64, 


implies that the scattering amplitude T p, which is related to S,, by 
Sab = Ô ab + i(27)* Ôt(pa — Pv) Ta; (9.99) 


will satisfy the relation 
1 
Im i = 5 > Tae Të (27) O*(p, 7 Pe). (9.100) 


In other words, the requirement that the S-matrix must be unitary implies 
that the imaginary part of the scattering amplitude T, is directly related to a 
sum over products of matrix elements connecting the initial and final states 
to all physical states with the same energy-momentum as the initial and final 
states. For our calculation we shall consider the fermion and anti-fermion 
scattering amplitude T (ff — ff) with the intermediate states being the two 
gauge boson states (see, for example, Feynman 1977; Aitchison and Hey 
1982). This is represented schematically in Fig. 9.4. 


(b) 
Fic. 9.4. The unitarity condition relates (a) the absorptive part of the ff > f f amplitude to (b) 
the sum of the squared amplitude for ff > AA in the physical region of two gauge bosons. 


The imaginary part of the scattering amplitude on the left-hand side of eqn 
(9.100) can be calculated by replacing the propagators in the intermediate 
states by their imaginary parts and multiplying them by the on-shell 
scattering amplitudes T(ff ~ AA) and T*(AA > ff) (Cutkosky 1960). 

For the vector boson propagator (1) of §9.2 we take the ’t Hooft-Feynman 
gauge with the gauge parameter ¢ = 1, 


A = 6(—g,,)/(k? + ie). (9.101) 
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It has the imaginary part 
57°, 5(k7)0(w) (9.102) 


where œ = |k|. Similarly the imaginary part of the ghost propagator (ii) of 
§9.2 is 


n 5” 5(k?)0(w). (9.103) 


The step functions in (9.102) and (9.103) have the effect of constraining the 
intermediate gauge particle states and ghost states to the same physical 
region. The unitarity condition for the fourth-order amplitude then reads 


[aor ge gt g” — SHS] 


si | dp, T® T®* P (k) P (k3) (9.104) 


uv n'v 


where 74° and S% are the ff > 44A? and ff > cc? amplitudes where 43 
and c’ are the gauge and ghost fields, respectively. The dp, integration is over 
the two (massless)-particle phase space. The P,,s are polarization sums of 
gauge particles 


PY ky) = J, et(ky, ojei (ki, 0) 


o=1,2 


P™ (k2) = ez(k2, 083 (k2, 0) (9.105) 


o=1,2 


where e{(k,, c) and ¢3(k,, 0) are polarization four-vectors of the two gauge 
particles with momenta k, and k, respectively. 

We note that in this case the left-hand side of (9.104) receives a 
contribution coming from the ghost fields while the right-hand side does not 
because ghosts are not physical states. This is the feature that makes the 
demonstration of the unitarity relation nontrivial. As we shall see, what 
ultimately allows the unitarity relation to hold is that the polarization sum 
P” in (9.105) is not just g”” and the effect of the ghost fields is just to make up 
the difference. 

We shall carry out the lowest nontrivial order calculation as in eqn 
(9.104). The imaginary part of the amplitude f f > ff of eqn (9.100) (the cut- 
diagrams of Fig. 9.5) has been written via eqns (9.102) and (9.103) as 
squares of the ff + AA amplitude (Fig. 9.6) and of the ff > c'c amplitude 
(Fig. 9.7). The factor of 1/2 on the left-hand side in (9.104) arises because 
there are nine diagrams when one squares the amplitude in Fig. 9.6, eight 
of them are just twice those of Fig. 9.5(a)-(d) and the ninth one corres- 
ponds to Fig. 9.5(e) with the closed gauge boson loop having a symmetry 
factor of 1/2. The FP ghost field c behaves like a fermion with c 4 c’; hence 
there is a minus sign and no symmetry factor in front of the SS* term. 
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T ~ 
rete Dork 


Fic.9.5. Fourth-order eMac for ff + f f where the intermediate-state particles are gauge 
particles and FP ghosts. 


ki 
P2 v.b 
FIG. 9.6. uaa, for T% where the final state is two gauge bosons. 


P| k, 


Fic. 9.7. Diagram for S® where the final state is two FP ghosts. 


The lowest-order diagrams for T% and S* are shown in Figs. 9.6 and 9.7, 
respectively 


ee q? 1 
T = — ig OP) > Vv o aoa So 
ee S 
1g“O(p2) y 7u kape > Pulpi) 


zabe (ky — RK), Guy + (ky + +h 


l 
— (2k, + Ko) Gna aes a ‘u(p1) (9.106) 
: abc l T T 
se = —ig’e a (k, +k)? v(p>) z Rims). (9.107) 
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Gauge-particle polarization. We now digress to a discussion on gauge- 
particle polarization. First, concentrate on one of the particles. The gauge 
particle being massless has only two physical polarization states, e’(k, o), 
o = 1, 2. Thus the three four-vectors k,,, €,(k, 1), €,(k, 2) do not completely 
span the four-dimensional space. We can furnish another vector n, such 
that 


yn: &(k, o) = 0, o = 1,2, (9.108) 
where ¢,(k, a) satisfies the orthogonality condition 
e(k, 1)-e(k, 2) = 0, (9.109) 


and the transverse condition 
k-e&k,o) = 0. (9.110) 


Since k? = 0 and n, cannot be proportional to k,, we must have k-n # 0. By 
the usual procedure of establishing completeness relations, these orthogo- 
nality conditions and the normalizations, s7(k, ¢) = — 1, yield the polariz- 
ation sum 


Fi = — Juv + Qix 
with 
Quy = [k nkun, + kva) — n°k,k,]/(k on)’. (9.111) 
Clearly the extra term Q,,, subtracts out the nontransverse polarization states. 
The task of checking the unitarity condition of (9.104) involves verifying that 
the FP ghost term precisely compensates for the extra projection terms in the 
polarization sum. Our calculation will be simplified if we adopt the 


convenient choice y? = 0. Then the extra terms in the two polarization sums 
of (9.105) take on the forms 


O"! (k, 11) = (kini + kent )/(ki ‘N1) (9.112a) 
and 


Q” (k2, m2) = (kzn + k202)/(k2 `na). (9.112b) 


Ward identities from lowest-order diagrams. To evaluate the right-hand 
side of (9.104) we need to study the contractions k4 T%%, etc. The first two 
terms of (9.106) do not vanish 


T (Pı — ky) +m 
7 z Vy 


ca" (ki — po) +m 
5 ao ae n Jua) 


b 


= —ig’i(p2) E z [ruon 


2 „abc 


= ge 0(p2) > Vulp) (9.113) 
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where we have used 


(Pı + m)k,u(p,) = [2p:k; — &,(p, — m)lu(pı) = (2p,k,)u(p,), 


etc. We make the parenthetical remark that, for the case of Abelian gauge 
theory (QED) in the oe gauge, we would obtain a null result in this 
contraction because ¢* = 0, yielding the familiar Ward identities: k4T,, 
= Tak} = 0. It follows caamediately that for Abelian theory the Q““ and 
Q% terms in the polarization sums will not contribute, and unitarity can be 
maintained without the presence of the FP ghosts in closed loops. The last 
term in T% of (9.106), which is present only in non-Abelian theories, when 
contracted ‘with k* yields 


—g?e"[2k, "kaga + (kı — Ko), k1y 


l 7 T? 
_(2k ee La 
(2k, + k2),k,,] (k, +k? v(p2) yu(p;) 


E a 
= — gopa) = pulps) (9.114) 
soe e a ae (9.115) 
(A, +k )? 2 
2 abc kay 
— gE ee ee * kul) (9.116) 


(ky +k)? 


Now (9.114) cancels (9.113), and the second term (9.115) vanishes because 
Pı + P2 = kı + k, and because of Dirac’s equation. The third term (9.116) 
is proportional to the ghost amplitude S® of (9.107). Thus we have 


ki TS = —iS*k,,. (9.117) 
Similarly, 
Toky = —iS”k,,. (9.118) 


From these relations we can also deduce k{T@k} = 0. These are examples of 
Ward identities for non-Abelian theories. (Their formal derivation will be 
presented later on.) 


It is then simple to check that the unitarity condition (9.104) is indeed 
satisfied as the right-hand side reads 


] 
5 [ATTS Lg + (kini + kinikini) 1] 
x [-g” + (kyny + kynd)(k2N2) +] 


l 
= | ao.(7T*a0 + [kK Tn2)(nyT*k2) 


+ (nı Tk.)(kyT*n2)\(kiny) (kan)! 
— [E(k T): (ny T*) + (T): (k T*) (kın) + 
—[(Tk2) :(T*n2) + (Tz) (T*k2)\(k2n2)*} 
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] 
=5 | oat +49 + 2SS* — 2SS* — 2SS*} 


= ; | apa(TT*ag — 2SS*} (9.119) 


where we have used eqns (9.117) and (9.118). 

To summarize, the unitary condition (9.104) relates the left-hand side, 
where we have used the covariant gauge Feynman rules of §9.2 with their 
spurious states of longitudinal polarization and FP ghosts, to the right-hand 
side where only the physical transverse polarization states appear because of 
the (axial) gauge conditions of (9.108). The spurious states of covariant 
gauge on the left-hand side do cancel among themselves and in the axial 
gauge on the right-hand side there are only physical states. In short, the FP 
ghost fields are needed in order to maintain the unitarity condition. 


The BRS transformation and the Ward identities 


In non-Abelian gauge theories with their FP ghost terms, the most efficient 
way to derive the Ward identities is through the use of the BRS (Becchi, 
Rouet, and Stora 1974) generalized gauge transformations. Again consider 
simple SU(2) theory with a set of fermions in doublet representation 


Z = —4FF! + piy"D,w — mpy (9.120) 
where 
D,Y = (0, — igALT*)y (9.121) 
F4, = 0,A% — 0,A4 + ge™ ARAS (9.122) 
with 
asl ‘and. J" = 7/2. (9.123) 
The Lagrangian is invariant under the local gauge transformation 
ow = —iT*O*%W (9.124) 
AS = QP Ae — - 0,,0°. (9.125) 


When we include the gauge-fixing term and the Faddeev—Popov ghost term 
according to (9.75) and (9.76), the effective Lagrangian density in the 
covariant gauge (9.70) becomes 


Lop = L+ Lo t+ Lra (9.126) 
with 
l 
Lop = -z (3" A9)? (9.127) 


LPG ae ic} Oa ap Ô, > JEabeAy |Cp : (9.128) 
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Instead of the complex ghost fields c and c’ it turns out to be more convenient 
to work with real Grassmann fields p and o defined by 


Ca = (Pa + i04)/ V2 
Ca = (Pa — io )//2. (9.129) 


Using the anticommutivity properties of the Grassmann fields p? = o° = 0, 
po = —op, etc. we have 


Levg = —10"p,(D,¢,) (9.130) 
with 
D,o" = 6,07 — ge oPAC. (9.131) 


Swis Not invariant under the general gauge transformation (eqns (9.124) and 
(9.125)) with an arbitrary 6%, but it is invariant under the BRS transform- 
ation 


ôA; = wD,o* (9.132a) 
ow = igal Toy (9.132b) 
dp? = —iw dV At/é (9.132c) 
ôo" = —gwe™ oa /2 (9.132d) 


where w is a space—time-independent anticommuting Grassmann variable 
and ¢ is the usual (covariant) gauge parameter. 
Sew is invariant under BRS transformations. As (9.132a) may be written as 


wo’ A‘, (9.133) 


6A’ = iô w0" — ige™ 


the BRS transformation is in fact a gauge transformation with a specific 
choice of the gauge function 


6° = —gwo* (9.134) 
Thus the original action S = {d*x¥ is unchanged under this transform- 
ation, ôS = 0. We need to show that, in eqn (9.67), 6(S,; + Sppg) = 0 also 
E (OX AS) + oD, | = z (43 0,(0A%) + 10"(6p")(D,,07) 
+ 10"p* 6(D,0"). (9.135) 
Concentrate first on the change of the covariant derivative o° 
6(D,o°) = ô( o" — geo" AS) 
= —gwe™ 0,(a°o°)/2 
+ ge*"(wD,o")0° + ge A(—gae’ata"/2). (9.136) 
Terms linear in g and in g* separately cancel 


—goe(d,0°)o° + gwe™ (ð o”): = 0 (9.137) 
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and 
g’ waa AS (e ehte 2 gcbd bea ES gobe phac) E o (9.138) 


The last combination vanishes because of the Jacobi identity. Thus 
ô(D,o°) = 0. (9.139) 
Using (9.132a) and (9.132c) in (9.135) we then have, 


5(S,r + Sreo) = | £ (0*A4) ô (@D"o") + ô" ($ 8a), | d*+x 


= | a| @ANOD,o") | dłx = 0. (9.140) 


This completes the proof that S.p is invariant under the BRS generalized 
gauge transformation (9.132). 


Derivation of the generalized Ward identities. The generalized Ward 
identities reflect the symmetry corresponding to the invariance of the 
effective action under the BRS transformation. To obtain these relations 
among Green’s functions we study the generating functional of the Green’s 
functions by introducing the sources J,,, a, B, z and y for the fields A,, p, ©, , 
and w respectively. It turns out that to obtain identities that are linear in 
derivatives with respect to the sources, it is convenient to also introduce the 
source terms K,, V, 4, and 4 for the composite operators D,6, 30 x 6, To 
and WT-o which appear in the BRS transformation (9.132). Thus the 
generating functional is of the form 


WJ, a, p, X> Lo K, V, A, A] 
- | [dA"] Edp] Edo] [dy] Edy] exp4i | d*x( Lay + Dt (9.141) 
where the source term È is given by 
E= J, AX +atp+B:o +y + yy +K: Do 
+ 4v: (o x 6) + AT -oy + YT - od. (9.142) 


Since S., is invariant under the BRS transformation, so is the generating 
functional of the Green’s functions ôW = 0, which implies 


fas [tea [dp][do][dy][dw](6z) 
x exp] reas + zo} =0 (9.143) 


where 62 is the infinitesimal change of the source term due to the BRS 
transformation 


6x = J, dA" +a-dp + B-d0 4+ x Ow + dy 


+ k,°5(D*e) + 5°36 x 6) + 15(T-o) + ôT -o)A. (9.144) 
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We shall first demonstrate that not only 6(D,6) = 0 as in (9.139); the changes 
of the composite operators all vanish 


6(a%a" — oo") 
= (—gw/2)(e ototo” + totoot — tototo" — totog) = 0 (9.145) 
and 
ôT -o)W = T- dof + T-o OW 
= T-(—£00 x o) — igw(T -oT -oy =0 (9.146) 
because 


l 
T:oT:o = T'T’o"o” = 5 (T°T® — T°T*)a%o° 
i i 
= - Ee™ Too” = ~T-(6 x 0). (9.147) 
2 2 
Thus eqn (9.143) may be written 
w [as [raa ... [dY ](J„ : D"o + «-0"A,/E — gB-6 x 6/2 


+ igyT -ow — igWT - oy) cp | dix [Zr + z1) =0 (9.148) 


or 


Ô a? ô g, Oo 
d4 Je gt __ ~% fra 
o| { eaa tT eo Bye OP av 


. Ô . 0 ue 
+ ig% sa 1g žr) W{J,..., A] =O. (9.149) 
This equation is the generalized Ward identity which relates different types 
of Green’s functions. To obtain relations for any particular set of Green’s 
functions we simply differentiate (9.149) with respect to the external source 
functions J,,, «, P, ... and set them equal to zero afterwards. This is a rather 
tedious procedure. A simpler way to get the content of (9.149) for the Green’s 
functions at hand is to use the fact that ôW = 0 also implies that Green’s 
functions are invariant under the BRS transformation (see, for example, 
Llewellyn Smith 1980). This can then be used to give relations that are 
equivalent to (9.149). For instance, 


6<0|T(A2(x)Ab(0))|0> = 0 (9.150) 
implies that 
<0|T(6AR(x)Ay(0))|0> + <T (Ai) 644(0))|0> = 0 (9.151) 
or 


0<0|T(D,,0%(x)A°(0))|0> + w<0|T(A4(x)D,o°(0))|0 = 0. (9.152) 
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Clearly, if we differentiate (9.149) with respect to J4(x) and J°(0) and set all 
source terms equal to zero, we also obtain (9.152) as the result. In practice 
this method used in (9.150) and (9.151) is quite direct. However all the 
information about gauge invariance is contained in eqn (9.149), which is 
more compact and is very useful for formal manipulation. 


Formal derivation of the Ward identity (eqn (9.117)). Finally we shall 
demonstrate that the Ward identities (9.117) and (9.118) used in proving 
unitarity in the illustrative example are contained in eqn (9.149). Consider 
the four-point function <0|7(pA,Wy)|0>. Its invariance under the BRS 
transformation yields 


COlT(OpA,)0> + <OlT(p 8A, py) 
+ CO|T(pA, py)» + <O|T(VA,W ôy) = 0. (9.153) 


But when the ws are on-shell the composite operators corresponding to dW 
and ow will not contribute because they do not have a one-particle pole. To 
see this more explicitly, 


COIT )A x2) (x3) 8Y (x4))10> 
= igo O|T (P(x1)A(x2)W(x3)T : o(x4)Y(x4)l0> (9.154) 


which has the momentum space representation shown in Fig. 9.8. Clearly it 
does not have a one-particle pole in the variable k, and will vanish when we 
put w on the mass shell by multiplying the inverse propagator (y-k, — m) 


k, k; 


FIG. 9.8. 


with k, — m. Therefore when all particles are on-shell, only those terms that 
are linear in the field will survive. Eqn (9.153) reduces to the form 


g OT(OALANTU)IOY + LOT" ôo) = 0. 
For the choice of ‘t Hooft-Feynman gauge (9.101) with the gauge parameter 
č = 1, this corresponds to 
ki Ta = —iS”k,,, 
just the Ward identity of (9.117). 
A final comment. We have illustrated the importance of Ward identities in 


checking the proper cancellation of the unphysical singularities (the longitu- 
dinal component of the gauge fields and FP ghosts) in the physical 
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amplitudes. This issue of unitarity is particularly relevant for the spon- 
taneously broken gauge theories. In a class of gauge choices for such 
theories, one encounters further unphysical particles, the would-be- 
Goldstone-bosons. One needs to check their decoupling by using Ward 
identities. Thus we must be sure that Ward identities are satisfied to all orders 
in perturbation theory. Since they are reflections of the theory’s symmetries, 
it is important that we adopt regularization procedures that respect these 
symmetries. One of the virtues of the dimensional regularization scheme is 
that it clearly preserves invariance under the BRS transformation—hence its 
consequence, the generalized Ward identities. However, as we studied in §6.2 
the validity of certain axial-vector-current Ward identities is not automatic 
even after the theory is regularized symmetrically. Thus for theories with 
fermions one must check that the theory is free of the ABJ anomaly which 
would spoil the renormalizability of the theory. 


10 Quantum chromodynamics 


HISTORICALLY the first successful application of the Yang—Mills theory was 
the unification of the weak and electromagnetic interactions (see Chapter 11). 
We choose to present first, however, the gauge theory of the strong 
interaction, quantum chromodynamics (QCD), since the basic structure of 
this theory is a somewhat simpler introduction to the subject as it does not 
involve spontaneous breaking of the gauge symmetry. 

QCD represents a remarkable synthesis of the various ideas we have 
developed about hadronic physics: quarks, partons, colour, current algebra, 
etc. The simple quark model was initially developed in early-1960s to account 
for the regularities observed in the hadron spectrum, with hadrons inter- 
preted as bound states of localized but essentially noninteracting quarks 
(§4.4). This view of quarks as the fundamental constituents became more 
plausible as relations abstracted from the quantum field theory of quarks, i.e. 
the algebra of quark currents and their divergences were successfully applied 
in the late 1960s (Chapter 5). It was also gradually realized that the above 
picture needed to be augmented with quarks having a hidden three-valued 
quantum number called colour. Then came a series of important experi- 
mental measurements, starting with the ones performed by the SLAC-—MIT 
group at the end of the decade, on deep inelastic lepton—nucleon scatterings. 
The cross-sections were revealed to satisfy Bjorken scaling which could be 
successfully interpreted by Feynman’s parton model (Chapter 7). The 
significance of scaling and the parton model picture is that although the 
hadron constituents (quarks) are not produced as free particles in the final 
states of deep inelastic scatterings, they behave as if they were weakly bound 
inside the target nucleon. As we shall see, all these features can be elegantly 
combined in the theory of QCD. 

The property of QCD that led directly to its discovery in 1973 as a 
candidate theory of the strong interaction is asymptotic freedom, 1.e. 
coupling strength decreases at short distances. In this chapter we shall 
concentrate mainly on the short-distance properties where perturbative QCD 
is applicable. Only in the last section will we touch upon the long-distance 
feature of quark confinement as analysed by the non-perturbative method of 
lattice gauge theory. It is a remarkable fact that here we have a theory of the 
strong interaction in which we are reasonably confident as to the correctness 
of the Lagrangian, but do not know how to deduce many of its dynamical 
implications for low energy-momentum scales: confinement, spontaneous 
breaking of chiral symmetries, and the hadron mass spectrum. It should be 
pointed out that all indications are that the assumed properties are indeed 
consistent with QCD. 
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10.1 The discovery of asymptotic freedom 


Gross and Wilczek (1973a,b) and Politzer (1973, 1974) discovered that, for 
non-Abelian gauge theories, the origin of the coupling constant is a stable 
fixed point in the deep Euclidean limit. (t Hooft (1972) also noticed that 
in Yang—Mills theories the slope of the renormalization group f-function at 
the origin is negative.) Theories having this property are referred to as being 
asymptotically free. This is remarkable as we shall show that no renor- 
malizable field theory can be asymptotically free without non-Abelian gauge 
fields (Zee 1973a; Coleman and Gross 1973). Thus in Yang—Mills theories, 
contrary to the case in all other field theories, the coupling constant 
decreases at short distances. In the familiar Abelian theory of QED, one 
has an intuitive understanding of the decrease of the effective coupling 
constant at long distance as being due to dielectric screening by the cloud 
of virtual electron—positron pairs. Thus, for non-Abelian gauge theories, 
we have to understand an anti-screening effect. As we shall discuss, the cloud 
of virtual gauge particles, which are bosons carrying (colour) charge and 
spin, makes the Yang-Mills vacuum behave like a paramagnetic substance 
and, through relativistic invariance, this implies that the vacuum anti- 
screens charges. 


Theories without Yang—Mills fields are not asymptotically free 


Let us recapitulate some of the relevant points made in Chapter 3 where the 
renormalization group was illustrated with the simple 1¢* theory 


1 2 42 A 4 
L = 5 10,6) — mg7] - 5 0+. (10.1) 


When all energy-momenta are scaled up op;, c > œ (the deep Euclidean 
region), apart from trivial dimension factors, the Green’s function depends 
on ¢t = Ino only through the effective coupling constant A(A, t), which is in 
turn governed by the renormalization group f-function (see eqns (3.104) and 
(3.105)) 

dA = 

ria f(A). (10.2) 


We are interested in the slope of the #-function at the origin 4 = 0 because, 
for small couplings, f can be calculated perturbatively and because the sign 
of the slope determines whether å = 0 is an ultraviolet or an infrared fixed 
point. If the theory is asymptotically free with 4 = 0 being a ultraviolet 
fixed point, we recover the canonical (i.e. free field theory) light cone 
singularities and the parton model with its free quarks at short distances as 
given in Chapter 7. 

The B-function can be calculated as follows. In a massless theory, the only 
scale parameter u appears in the subtraction point which is needed to define 
all the renormalized quantities. From eqns (3.32) or (3.71), the B-function in 
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this case is given by 


B(A) = MS (10.3) 


where À is the renormalized coupling constant related to the bare coupling as 
given by eqn (2.50) 


A= Z; ZGAq.- (10.4) 
Z, is the scalar wavefunction renormalization constant, 


$ =Z; ebo (10.5) 


or, equivalently, is defined in terms of the unrenormalized scalar propagator 
at the subtraction point chosen to be, for example, some Euclidean point 
pP = —p?, 
—iZ 

=—;*. (10.6) 
p? ss igi u? u 
Similarly, the vertex renormalization constant Z, can be defined through the 
unrenormalized four-point vertex function as in eqn (2.38) 


= —1A9/Z,. (10.7) 
p? = -u 
The one-loop contributions to the two-point and four-point Green’s 
functions and hence to Z, and Z, are shown in Fig. 10.1. Thus the 
renormalization constants are functions of the bare coupling 4, and the ratio 
of the cut-off A to the subtraction parameter u. 


(py , P2, P3» Pa) 


N 7 
zgr 3 
-x ` ae a `X 
: h 2’ sar N Bed ETA 
oo Se ee ee gon / x 
/ N / \ 


(a) (b) 


Fic. 10.1. Lowest-order scalar meson (a) self-energy and (b) vertex radiative correction graphs. 


To obtain the f-function as in (10.3) one merely has to calculate the 
divergent part of the Zs and differentiate with respect to the cut-off (see eqn 
(3.44)) 


B= [2 In Zy(Ao, A/w) — In Z,(40, A/y)]. (10.8) 


0 
a ee 
OlnA 
The one-loop contribution to Z, in Fig. 10.1(a) vanishes (Z, = 1); the 
only nontrivial diagrams are those in Fig. 10.1(b) (see eqn (3.47)). We obtain 
+ 3)? 


BOA) = = Ez + O(2?). (10.9) 
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As the A < 0 region is not allowed (since the Hamiltonian is unbounded from 
below), the positive slope for 2 > 0 means that the simple 1¢* theory of 
(10.1) does not exhibit free-field asymptotic behaviour at large Euclidean 
momenta. This situation actually holds for the entire class of scalar field 
theories with internal symmetries, i.e. eqn (10.1) is generalized to 


ọ > Qi 
Ap* ay AijraP iP j PrP 1 


where 4, ,,; is Symmetric in its indices. The scalar fields ġ;(x) belong to some 
(possibly reducible) representation of the symmetry group and in every term 
all the internal symmetry indices are contracted. Now we have a whole set of 
quartic couplings, satisfying equations generalized from (10.9). 


kt dt 16r? 


LAijmnAmnkt + Aitmn4mnjt + AitmnAmnjk]. (10.10) 


The theory is still not asymptotically free because one can easily find that 
there are -functions having positive slopes. For example, 


3 
Piia = TEZ Atma Amn > 0. (10.11) 


We now consider theories with scalar bosons and fermions interacting 
through the renormalizable Yukawa coupling 


L = Wir" 6, — mY + 36,6)? — mip] — A6* + pw. (10.12) 

We have two coupled renormalization group equations corresponding to the 
graphs in Figs 10.2 and 10.3 
dA 


ba = Gy = Aad’ + Bidp* + Capt (10.14) 


d 
p= + = A,p>? + B,2?p. (10.15) 


It should be noted that the lowest-order terms are not necessarily all single- 
loop diagrams. In particular, Fig. 10.3(b) is a two-loop term. However, in 
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Fic. 10.2. Lowest-order contribution to the A-coupling renormalization constant. 
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(a) (b) 
Fic. 10.3. Lowest-order contribution to the Yukawa coupling renormalization constant. 
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order to have a negative $, in eqn (10.14) with A, positive, we must have p? 
at least of the order of A, hence the 47 term of Fig. 10.3(b) may be dropped 
when compared to the p°? term of Fig. 10.3(a) which yields the result 


+1 
bo = yea 2tatat 2p >O. (10.16) 


Hence the theory (10.12) is also not asymptotically free. Again this statement 
can be generalized to the entire class of renormalizable theories with spin-0 
and spin-1/2 fields having all possible internal symmetries. Thus we have the 
interaction Lagrange density 


vA = —AijzmPiP j Pr Pm a WalAvp T iBhoYs) WebP (10.17) 


The combination p*, = A*,+iB*, satisfies the renormalization group 
equation generalized from (10.16) as 


d 1l we rare ee 
Po = i = zga Pp” + ap'p"'p' + ppp” 
+ (tr p'p)p! + (tr p” pòp], (10.18) 


repeated indices being summed over. We have used the property that, when 
a (massless) fermion propagator is moved over, the gamma matrix com- 
mutation is such that one has p > pt. From this, we get 


d a. Al ee 
gy E OP) = gra [2(tr p” ppp’) 


pees lee. aed 
+ (tr p” ppp) + 5 (tr p” ppp”) 
+ Re(tr p"p’)(tr pp’) + (tr ptr pp]. (10.19) 


The second and third terms on the right-hand side are positive definite since 
they are traces of the square of hermitian matrices (p'p""). The fourth term is 
less than the last term because 


Re[(tr p” p (tr p""p’)] < |tr(p"p’)/? = (tr p"p/)(tr pip"). (10.20) 


Hence, 
8n* (tr pip’) = 2L(tr ppp") + (tr p” Ptr p'p”)] 


= (PibPia + Paahi ok + php) 20. (10.21) 


to reach the last line we have written out the trace terms explicitly and 
relabelled some of their indices. Thus all renormalizable theories with only 
spin-0 and spin-1/2 fields are not asymptotically free. Finally we have the 
familiar result that the QED f-function has a positive slope at the origin of 
coupling space, since 


e = Zi !Z, Z! e, = Ze, (10.22) 
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where we have used the Ward identity Z, = Z, and Z, is the photon 
wavefunction renormalization constant that can be calculated from the 
vacuum polarization diagram Fig. 10.4(a) 


Z4,=1- aay (10.23) 
67 


Fic. 10.4. 


Thus, just as in (10.8), we have 
o[In Z3'7(e, A/p)] 


pee din A 
+ 3 
-a a + Oe). (10.24) 


Similarly, for scalar QED with Z, receiving a contribution from the charged 
scalar meson loop diagram of Fig. 10.4(b), we obtain 


+e? 
© 482? 


B + O(e°). (10.25) 


Non-Abelian gauge theories are asymptotically free 
The general Yang—Mills theory has the Lagrange density given in §8.1 
L = —Ftr(F,,F*’) (10.26) 
F, = 0,A, — 0,A, — igLA,, Ay] 
where 
A, = T'A; 

is a matrix of hermitian vector fields with 

CT", T®] = ic” Ts 

tr(T?T°) = 4 6”. (10.27) 


To quantize the theory we must fix the gauge. In §9.2 the covariant-gauge 
Feynman rules are given, with its gauge-fixing terms and the Faddeev—Popov 
ghosts. We have in particular the gauge boson propagator in the form 


ab 


k? + 


iA®(k) = iL—g,, + (1 — Ok, k,/k7] (10.28) 
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where é is the gauge parameter. The -function can then be calculated much 
in the same manner as in all the other renormalizable theories considered 
above. If we choose to define the gauge coupling through the vector three- 
point function according to rule (iii) of §9.2 


iar (ki, k2, k3) = gC? Lgyy(ky — ka), + galka — k3), 


+ Jalka — ki). (10.29) 
Then the renormalized coupling constant is related to the bare coupling as 
g = ZÌ? Z; "90 (10.30) 

where Z, is the vector wavefuntion renormalization constant 
A, = Z4°*'*Ao, (10.31) 
or, equivalently, is defined in terms of the unrenormalized (transverse) vector 

propagator at the (Euclidean) subtraction point k? = —p? 

LIAM A) To lk? = — 2 = Z on $ Ee) ad ae (10.32) 


Similarly, the vertex renormalization constant Z, can be defined through the 
unrenormalized three-point vertex (10.29) 


ERALA 5) kz, k3)Jo = Z; 90 Loy (ks os Ka)a a Gyalky _ k3), 
+ Galk3 — kı) ] at k? = —p?. (10.33) 


One may find it helpful to compare eqns (10.30), (10.31), (10.32), and 
(10.33) to their A@* counterparts in eqns (2.40), (2.23), (2.21), and (2.38). 
The one-loop contributions to Z, and Z, are shown in Figs. 10.5 and 10.6, 
respectively. After a tedious calculation one finds 


ge (13 
Zi =e eV) nh (10.34) 
lór \ 3 


Fic. 10.6. Trilinear gauge-boson vertex correction. 
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and 
2 
go (17 3é 
Z.=1 ee VA 10.35 
where 
t,(V) OP = CMO. (10.36) 


As we have seen in §4.1 the structure constants C* themselves form the 
adjoint representation matrix 


cm = [T(V) Jac (10.37) 
and (10.36) may be written as 
t,(V) 6” = tr{T(V)T(V)}. (10.38) 


We have labelled these quantities with V, for vector, since the vector gauge 
fields A’ belong to the adjoint representation of the group. Hence (10.38) 
shows that ¢,(V) can be interpreted as the sum of the squared symmetry 
charges of the vector gauge particles. Also from the simple property of the 
SU(n) adjoint representation discussed in §4.1, we obtain by using eqns (4.21) 
and (4.134) 


t.(V)=n for SU(m). (10.39) 
From (10.34) and (10.35) we immediately obtain the famous result 


0 3 
B, = -In A ($in LA -im Z, 


3 
g? 11 


which, at this one-loop level, turns out to be independent of the gauge 
parameter č. If the gauge fields are coupled to fermions and scalar mesons 
with representation matrices T7°(F) and T°(S), respectively, then we can make 
use of results (10.24) and (10.25) directly 


B, = 2. -5 to(V) + ; ta (F) + ; 1) (10.41) 
where 
t (F) 6” = tr{T(F)T°(F)} 
and 
t,(S) 6” = tr{T*(S)T*(S)} (10.42) 


are the sums of the squared symmetry charges for the fermions and scalars, 
respectively. For fermions and scalars in the fundamental representation of 
SU(n), we have TS) = TF) = A*/2 and 


t,(F) = 1,(S) = 1/2. (10.43) 
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If one uses the two component fermion fields or real scalars the coefficient of 
t,(F) and 7,($) in (10.41) should have an additional factor of 1/2. 


The Yang—Mills vacuum as a paramagnetic medium 


One of the most remarkable features of quantum field theory is that Yang— 
Mills theories are the only asymptotically free theories in four dimensions. As 
it turns out there is a simple physical explanation of this phenomenon 
(Nielsen 1981; Hughes 1981). This explanation ultimately has to do with the 
fact that non-Abelian gauge fields have spin and obey Bose—Einstein statistics 
and, unlike the Abelian photon, they carry the gauge symmetry charges 
themselves. 

As we have mentioned at the beginning of this section, asymptotic freedom 
means that the vacuum anti-shields charges, i.e. it acts like a dielectric 
medium with dielectric constant 


e<l. (10.44) 


Also the quantum field theory vacuum differs from the ordinary polarizable 
medium on a very important point: it is relativistically invariant. This means 
that the (relative) magnetic permeability u is related to the dielectric constant 
by 


ue = | (10.45) 


so that the velocity of light is 1 in the vacuum. This allows us to translate the 
electric responses into its magnetic responses, which have two elements. 


(1) Landau diamagnetism (u < 1). The charged particles in the medium, in 
response to the external magnetic field, produce a current which itself induces 
a magnetic field opposing the original field. 

(2) Pauli paramagnetism (u > 1). If the particles have magnetic moments 
they tend to align with the external field. 


It turns out to be easier to visualize the magnetic response of the Yang—Mills 
vacuum; the anti-screening of (10.44) means 


>l. 


The Yang-Mills vacuum acts like a paramagnetic medium. We note that 
such a correspondence does not exist for ordinary polarizable material which 
can, for example, have both the properties of (e > 1) dielectric screening and 
of (u > 1) paramagnetism. 

It should be emphasized that the electromagnetic terminology is used here 
only as an analogue to ordinary U(1) gauge theory. Thus by charge we really 
mean the gauge symmetry charges. For example, in the SU(3) gauge theory 
of QCD, they are the colour charges; by electric and magnetic responses we 
mean the colour electric and magnetic responses, and so on. When we say 
that the Yang—Mills fields of QCD (gluons) carry charge, magnetic moment, 
electric quadrupole moment, etc. we mean they carry colour charge, colour 
magnetic moment, etc. (In actual fact gluons are electrically neutral.) What 
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are then the charge, magnetic moment, etc. of the Yang-Mills particle? 
Consider the simple SU(2) theory. It has a trilinear gauge field coupling given 
by the Feynman rule (iii) of §9.2. 


irse (k > ka, k3) = ie” Lg, (ky aig kK), + Ivilk2 a k3), 
+ Gulks — ky)v] (10.46) 


which can be thought of as the vertex of a charged spin-1 particle and a 
photon. For the purpose of this interpretation we assume that the SU(2) 
gauge fields 4%; have the correspondence 


A} => A, corresponds to the photon field; 


5 (A1 + A?) > Vt corresponds to the charged spin-1 fields with mass M. 
(10.47) 

Then the e*” factor gives 
e 13 = 4(e!?3 — 6713) =i. (10.48) 


We pick the momentum configuration such that (recall that all k;s are 
supposed to point into the vertex) 


ki = (m, -5) k5 = (-m. -5) k3 = (0, k) 


for M > |k|; the polarization vectors for the charged particles are 


gt = k'e e ct = k-e e’ 
1 7 IM?’ ` 2 IM’ 


so that e; +k; œ ¢,:k, ~ 0. After contracting e4, > and A? into (10.46) we 
obtain 


Ta 84654" = 2M| ge ‘Ay + =e x e’)-(k x a) + O(k?). 
(10.49) 


Thus we can identify g with the electric charge and g/M with the magnetic 
moment corresponding to a gyromagnetic ratio of 


ipo. (10.50) 


We can calculate the vacuum energy density in the presence of an external 
magnetic field 
l 
uo = — B?.. (10.51) 
2u 
From this the magnetic permeability u can be extracted. Nielsen (1981) and 
Hughes (1981) have shown that, for u= 1 + y where y is the magnetic 
susceptibility, 


] 
L~ (he ¥ (-3 + 7s) (10.52) 
S3 
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where q, y, and s, are the charge, gyromagnetic ratio, and the projection of 
spin in the direction of the external magnetic field, respectively. The two 
terms correspond to diamagnetic and paramagnetic responses, respectively. 
The factor (— 1): in front means that there is an extra minus sign for a 
fermion system. When normal-ordering the creation and annihilation 
operators in the Hamiltonian to isolate the vacuum energy term, the 
anticommutation relations of the fermion fields give rise to this extra minus 
sign. (This is the same reason that a fermion loop in Feynman graphs is 
accompanied by a minus sign.) As a simple check one sees that for fermion 
(Vr = 2) 

xr ~ —4F2(—4$ + 1) = — $q. (10.53) 


That the susceptibility is negative means that the system is diamagnetic 
(up < 1) hence has the property of dielectric screening £p > 1 as in QED. 
Also note the well-known ratio of 3 for the relative paramagnetic and 
diamagnetic contributions. Keeping in mind that the massless vector gauge 
particles have only two helicity states s = +1, we obtain, for the vector, 
fermion, and scalar particles 


X ~ Za — bak — 348. (10.54) 


To convert these to the gauge charges, the squared charge factors are 
identified with the trace terms of (10.38) and (10.42) 


qi > 3t(V) 
qt > t2(F) 
gg > t,(S). (10.55) 


We then obtain a result identical to (10.41). The factor 1/2 in (10.55) reflects 
the fact that in gauge theories the vector particles have been represented by 
hermitian fields and each complex charged field actually has two real 
components (see, for example, eqn (10.47)). 


Gauge theories with scalar mesons 


We now consider the possibility of giving all Yang-Mills vector bosons 
masses through the Higgs mechanism (as in §8.3) without destroying 
asymptotic freedom (Gross and Wilczek 1973b; Cheng, Eichten, and Li 
1974). This appears to be very difficult. The problem has to do with the 
quartic couplings of the Higgs scalars which tend to be ultraviolet unstable. 
The modification of eqn (10.10) involves adding contributions from the 
diagrams in Fig. 10.7 


l 
en [1283(S)g ijr = 3A ig] (10.56) 


2 7 
N 7 re 7 
Pa . NS Z : 
~ +3 other diagrams eK +2 other diagrams 
4 N Fá ‘N 
7 \ 7 ‘N 


7 7 
4 Xo | N 


7 N 
Fic. 10.7. 
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where 
52(S) 0, = (T"S)T"S));; 
Aina = (TS), T°(S)}i{ TS), T’(S)} x1 
+ two other terms by permutation. (10.57) 
Thus basically we have renormalization group equations of the form 


dg? 


“dt = —bog* bo > 0 
dA 2 2 4 
ET AA- + Bag’ + Cg”. (10.58) 
Introducing the variable v = A/g?, eqn (10.58) becomes 
1 dv 
re Ay? + Bv+C=B, (10.59) 
g“ at 


with B = B’ — bọ. Asymptotic freedom requires that g —> 0 and 4 —> 0, i.e. v 
approaches a fixed point in the ultraviolet limit. Since the right-hand side of 
(10.59) is a second-order polynomial, the condition for B, = 0 to have real 
roots is simply 


A = B? — 44C > 0. (10.60) 


Let us call these two roots v; and v, with v, > v,. Since the slopes at these 
two points 


d 
By = A(y, = V>) < 0 
dv |, 
and 
dp, 
dy = A(v, — vı) >0, (10.61) 


the smaller v; is a stable fixed point. But A and v, are required to remain 
positive (so that the classical potential is bound from below); this requires 


B<0 (10.62) 


because both A and C are positive and v, = (— B — ./A)/44 < —B/2A. In 
all the cases examined, these asymptotic conditions (eqns (10.60) and (10.62)) 
always imply that only a small number of scalar mesons are allowed in the 
theory, too small a number to do the job of breaking down the gauge 
symmetry completely and giving all gauge bosons nonzero masses. This 
situation is not changed even in the presence of Yukawa couplings: their 
contributions to (10.58) are generally small. (An important exception is the 
supersymmetric theory.) Thus gauge symmetry is not broken spontaneously 
in an asymptotically free theory and this suggests that one should work with 
such theories omitting elementary scalars altogether. 
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Product groups 


Up to this point we have restricted our considerations to simpe Lie groups, 
i.e. theories with only one gauge coupling constant. The more general case 
involves direct products of simple groups G, x G, x ...G,, each with its 
own coupling constants g;. To lowest-order in g? the f-functions are 
independent of each other and, therefore, the results can be deduced directly 
from those for simple groups. In particular, if one of these factors, G;, is an 
Abelian U(1) group, the associated gauge couplings will not be driven to zero 
and the theory is not asymptotically free. 


10.2 The QCD Lagrangian and the symmetries of the strong 
interaction 


The success of the quark—parton model in describing Bjorken scaling as 
observed in deep inelastic lepton—hadron scattering clearly suggests that the 
field theory of the strong interaction should be asymptotically free so that the 
quark can interact weakly at short distances. We have shown in the last 
section that only Yang-Mills theories can exhibit free-field asymptotic 
behaviour at large Euclidean momenta. 

Which symmetry of the quark model should be gauged? We have already 
seen in §4.4 how, by postulating that quarks have a hidden three-valued 
quantum number called colour, one can overcome the paradoxes of the 
simple quark model. This idea of exact colour symmetry is strengthened by 
the agreement with experimental measurements of the anomaly calculation 
of the n° > 2y rate (§6.2) and of the parton-model calculation of a(e*e™ 
— hadrons) (§7.2). Furthermore, since we also need to assume that only 
colour singlets are observable, it suggests that the forces between the 
coloured quarks must be colour-dependent. In fact a colour-independent 
strong interaction would imply the phenomenologically unacceptable result 
that every hadron should have degenerate partners having different colours. 
All this leads to the idea that it is the colour symmetry of the quark model 
that should be gauged. Thus, the strong interaction should be described by 
an SU(3) colour Yang—Mills theory with each flavour of quarks transform- 
ing as the fundamental triplet representation. This, together with our 
requirement that the strong interaction theory be renormalizable, fixes 
(almost) completely the form of the Lagrangian. The theory is called 
quantum chromodynamics (Gross and Wilczek 1973a; Weinberg 1973); 
Fritzsch, Gell-Mann, and Leutweyler 1973) with a Lagrangian usually 
written as 


nf 
Locp = —4 tr G,,G" + X qliy" D, — m,)a, (10.63) 
k 


where 
G,y = 0,A, — 0,A, mA A] 


Dds = (ô, = IgA ,,)qy, 


8 
Ap 2 A24" /2 (10.64) 
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where the A’s are the Gell-Mann matrices that satisfy the SU(3) commutation 


relations 
A, Ap aes AS 


and the normalization condition 
tr(AtA°) = 2 6”. (10.66) 


The strong interaction gauge fields Af are called gluons and the q,s are the 
quark fields with the subscript k being the flavour index k = 1, 2,..., mp (nç is 
the number of quark flavours) 


qg: u, d, s, c, b, ... (10.67) 


In (10.63) we have left out a possible SU(3)-invariant and dimension-4 
renormalizable term 


trG,,G"" with G* = 4G. 


Such a term can be written as the divergence of a current tr GG ~ "K, hence 
it contributes only as a surface term in the action. Making the usual 
assumption of fields vanishing at infinity Aj — 0, one is normally justified in 
discarding this term. As it turns out, for a class of gauge fields with nontrivial 
topological properties, this justification may not hold. Experimentally we 
know that, if the tr GG term exists in the QCD Lagrangian, it must be 
multiplied by an extremely small coefficient. For the time being we shall 
decree its absence and shall take up this whole area of instanton problems in 
Chapter 16. 

The QCD Lagrangian (10.63) clearly possesses all the well-known strong- 
interaction symmetries. It conserves charge conjugation and parity. Because 
the gluons are flavour-independent it conserves strangeness, etc, In fact eqn 
(10.63) has all the flavour symmetries of a free quark model, particularly the 
SU(3) x SU(3) chiral symmetry, broken explicitly by the quark mass term, 
as discussed in Chapter 5 (see comments at the end of §5.5). If QCD dynamics 
is such that chiral symmetry is realized in the Goldstone mode then all the 
successes of PCAC and current algebra can be accounted for. 


Gauge invariance, renormalizability, and QCD symmetries 


It is important to realize that these symmetry properties are not put in eqn 
(10.63) by hand; they are the consequences of gauge invariance and 
renormalizability. In the following we shall show that (10.63) is equivalent to 
the most general renormalizable SU(3) Yang-Mills theory of quarks and 
gluons 


Z 
GL = UA + Bys)iy"D,q + GC + Dys)\q—FtrGG (10.68) 


where A, B, C, and D are all hermitian matrices in the flavour space and Z is 
a constant. We can rewrite (10.68) in terms of the left-handed and right- 
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handed quark fields 
qi = ¿(1 — 75)q, qr = (1 + y5)q 
with the result 


GO) = q(A + B)iy"D, qi + Gr(A ~~ B)iy"D,Gr 
. . Z 


This can be transformed into eqn (10.63) by the following two steps. 
(1) We will first rescale the gluon field and gauge coupling using 
AS > ZPAS, g > Z“ iQ 
so that Z — 1 in the gluon kinetic energy term without affecting the covariant 
derivatives of the quark fields. Then introduce new q, and qr quark fields so 


that the two independent matrices A + B and A — B both become unit 
matrices. Now the Lagrangian takes on the form 


GON = quly"D, ai + Griy"D, Gr + Gi:Mqr + GrM'q, —4tr GG 
(10.70) 


where 
M=C+i1D (10.71) 
or, equivalently 
LY") = Giy"D,q + G(C + iDy5)q — 3 tr GG. (10.72) 


We have not bothered to introduce new labels for the new fields and matrices. 

(2) We now make use of an important result of the linear algebra. (A 
proof of this theorem will be presented in §11.3.) It states that a general 
matrix such as M in (10.71), which is neither diagonal nor symmetric, can 
always be diagonalized with positive eigenvalues by a bi-unitary transform- 
ation. Thus, 


SMT'=M, (10.73) 
where S and T are unitary matrices and M, is diagonal with positive elements 


tr Ma => m. (10.74) 
k 


We see immediately that this allows us to transform Y™ of (10.70) into the 
canonical form of (10.63). Thus, we can redefine the quark fields 


qı > SqL and qr > Tqr (10.75) 


so that the mass term in (10.72) is diagonalized and free of y, without at the 
same time introducing y;s into the D,q terms. The physics that makes this 
possible is that strong interactions are mediated by flavour-neutral vector 
gluons. Had it not been for the flavour-independence of the colour gluon 
fields, we would not have the matrices A, B, C, and D in (10.68) to commute 
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with the generators of the gauge group; had it not been for the spin-1 nature 
of the gluon, the symmetries of the strong interactions would be controlled 
by other terms besides the quark mass matrices. Thus this is another 
argument (besides the difficulties discussed in the last section on the Higgs 
mechanism in an asymptotically free theory) against the presence of 
elementary scalar fields in strong interaction theory. 

Two more general comments on QCD symmetries follow. 


(A) Chiral symmetries of QCD. We have already stated that, in the limit 
m, = M = mM, = 0, Locp, like that of the free-quark model, is invariant 
under the chiral unitary transformations of eqns (5.39) and (5.47). In other 
words, in the absence of the quark mass matrix, the theory is invariant under 
the unitary transformations of (10.75), and we have a U(3), x U(3)p 
symmetry. The diagonal subgroups SU(3) and U(1) are realized in the 
normal mode; i.e. the vacuum is also invariant under the U(3),,, trans- 
formations. The hadrons form degenerate SU(3) multiplets and baryon 
number is conserved. The remaining symmetries—the axial SU(3) and U(1) 
symmetries, corresponding to the U(3),_, transformations—are not 
manifest in particle degeneracies. Since we are not using any elementary 
scalar fields in the theory, we must assume that the dynamics is such that the 
QCD vacuum breaks these axial symmetries. (Whether this actually takes 
place is a difficult dynamical problem that is still not completely settled yet; 
but all indications are that this indeed takes place according to our 
expectation.) The Goldstone theorem then informs us that there should be 
approximately massless pseudoscalar mesons in the hadron spectrum. Eight 
of them can indeed be identified readily 3 ms, 4 Ks, and 1 n. However, since 
we need to break an axial U(3) symmetry, we are still one pseudo-scalar 
short. This is the famous axial U(1) problem. Namely, in the massless limit 
QCD (in fact any quark model) is invariant under the phase rotation 


q, > hq (10.76) 


where we have the same @ for all k, i.e. u, dı, Sı are multiplied by a common 
phase e™™ and up, dr, Sg by e”. This approximate symmetry is not observed 
in the strong interaction: it is not realized either in the normal or the 
Goldstone mode. The resolution of this U(1) problem will be discussed in 
Chapter 16 in connection with the instanton solutions of QCD mentioned at 
the beginning of this section. 


(B) Stability of QCD symmetries against weak radiative corrections. We 
now discuss briefly the problem of symmetry violation terms as induced by 
weak radiative corrections (Weinberg 1973b). Although the subject of gauge 
theories of electroweak interactions has not been introduced, we can still 
discuss this problem since we only need a few general properties of such 
theories. 

(1) The generators of the electroweak gauge group commute with all those 


of QCD, i.e. gluons are flavour-neutral and weak intermediate vector bosons 
(W-bosons) and currents are all colour singlets. 
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(ii) The weak interactions have an energy scale set by the W-boson masses 
My, = O(10? GeV). 

(iii) The weak gauge coupling constants are of order e and are related to 
the familiar weak Fermi constant by Gp = O(a/M4,), where « = e7/4z is the 
fine structure constant. 


Knowing that weak couplings are O(e), a priori one would fear that weak 
radiative corrections would induce unacceptably large O(a) violations of 
parity and strangeness conservations. However property (i) implies that these 
radiative corrections themselves are all invariant under SU(3)oou. In 
particular the order « additions to the Lagrangian must be dimension-4 
operators; hence (as we have demonstrated above), with suitable re- 
definitions of the gluon and quark fields, the Lagrangian can be restored to 
the canonical form of (10.63) with all its symmetries. Terms involving 
operators of dimension D > 4 will, by dimensional analysis, be multiplied by 
coefficients (My) °~®. For example, a term of the form qy,(1 — ys)qqy" 
(1 —ys;)q has D = 6; hence it must have a coefficient O(a/M%) and, by 
property (iii), of order Gr. 

Thus QCD has the attractive feature that in zeroth order it automatically 
possesses a set of global symmetries which match perfectly with the known 
strong-interaction symmetries and which are stable against weak radiative 
corrections. 


10.3 Renormalization group analysis of scaling and scaling 
violation 


For the QCD Lagrangian (10.63), the renormalization group f-function 
(eqn (10.41)) with 7,(V) = 3, t (F) = 1/2, and t,(S) = 0 takes on the value 


—1 


E (11 — n) g? = —bg? (10.77) 


p 


where n; is the number of quark flavours. As one changes the momentum 
scale p; > Ap;, the effective gauge coupling g(g, t) obeys the equation 


dg a 
ae — bg (10.78) 
with 
t=Ind. (10.79) 


One can integrate eqn (10.78) to obtain 
FE 
1 + 2bg°t 


g(t) = (10.80) 


where g = g(g, 0). Thus for n; < 17, i.e. b > 0, the denominator of (10.80) 
cannot vanish. For large momenta Ap; with 1 > œ, we have g(t) —> 0 and 
asymptotic freedom. But we should note that the effective coupling decreases 
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to zero very slowly, as a logarithm, g ~ (2b'In 4)” 1. For convenience, we can 
choose the scaling parameter / as the ratio of the momentum of interest Q to 
the subtraction scale u, i.e. 4? = Q?/p? or t = $ In Q?/u*. Then we can rewrite 
eqn (10.80) 


x(u?) 
1 + 4nba (u?) In O7/p? 
where a (Q?) = g7(t)/4n and a (u°) = g°/4n. We can further simplify this 
equation by defining the parameter A through the equation 
i 
a (4*)4nb 


a (Q?) = (10.81) 


In A? = In g? — 


to get 


4r 


(11 = 4n) In Q7/A? aa 


a (Q°) = 
In this form, the strong gauge coupling constant « (Q?) is expressed in terms 
of one single parameter A. From this we see that for small momenta, «,(Q7) 
increases and in fact it diverges at Q? = A*. Even though eqn (10.82) is a 
perturbative formula and breaks down for large couplings, the value of A is 
still a useful measure for the energy scale where the strong-interaction 
coupling constant becomes large. Hence A is the fundamental momentum 
scale of the theory and is called the QCD scale parameter. 

Since QCD is asymptotically free, at first sight one would think that this 
allows us to use the renormalization group and perturbation theory to 
calculate a large number of high-energy processes. Actually this is not the 
case: the renormalization group analysis is a theory of scale transformations 
and this involves uniform multiphcation of all components of the four- 
momenta; the ultraviolet asymptotic limit is the deep Euclidean region where 
all particles are far away from their mass shell. Fortunately, there are 
physical situations where some of the ‘external particles’ are infinitely off 
their mass shell. In the lowest-order electroweak coupling approximation, 
the semileptonic inclusive processes can be factorized into a known leptonic 
part and a hadronic quantity that corresponds to a forward scattering 
amplitude of a photon (or W-boson) with variable mass — gq’. In particular 
the cross-sections of e*e annihilation and lepton—hadron scatterings 
measure the absorptive part of the electroweak current product matrix 
elements (see eqns (7.10) and (7.152)) between some state |A» 


CA|T( J, (x) J, (OAD. (10.83) 


The high-energy and high-(— q7) limit does correspond to the deep Euclidean 
region. (For general descriptions of applications of asymptotic freedom see 
Politzer 1974 and Gross 1976.) 


ete” annihilation 


According to eqn (7.152), for the case of e*e~ annihilation, |A> of eqn 
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(10.83) is the vacuum state. Thus the photon mass q? is the only scale 
and renormalization group analysis can be applied directly. Consider the 
inverse photon propagator in QED 


(q) = (g + ¢"*q’/¢?)(q’) (10.84) 


where the vacuum polarization TI(q°) has the naive dimension 2, so that the 
relevant renormalization group equation (3.58) becomes 


0 ð 
È Sp KS J II(q7) = 0 (10.85) 
Ot g 
where y, is the anomalous dimension of the photon field, 
1 ô 
= —~~—~ [l A/w)]. 10.8 


Z, is the usual photon wavefunction (i.e. vacuum polarization) renormaliz- 
ation constant (with its one- and two-loop graphs shown in Fig. 10.8) which 


yields 
Ya = c(3 3 a) + ag T =| (10.87) 
Fic. 10.8. 


For T°(V), the representation matrices of the vector gauge fields, we have 
52(V) Ôij = (T°V)T(V));; 
= (n? — 1)/2n for SU(n). (10.88) 


For SU(3) gluons s,(V) = 4/3. e, is the electric charge of quark flavour k. We 
have not written out the precise form of the proportional constant C since it 
will be cancelled in the result that we shall quote. Solving (10.85) as in eqn 
(3.68), we have for Q? = —q?’, 


t 


I1(Q*) = Q? exp| —2 | rat )) ar | 


0) 


~ ap — ac(3 X ih: F an Miis J| (10.89) 


The ø(e*e7 > hadrons) cross-section can be obtained by taking the 
absorptive part 


R(Q?) = o(e*e” — hadrons) 2 (3 Ze) r a(o’) : | 


a(e"e” > wp) k 
(10.90) 
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Therefore the simple parton scaling result of eqn (7.104) is recovered 
together with a QCD correction term (Appelquist and Georgi 1973; Zee 
1973b) with a,(Q7) given by (10.81). Thus the ratio R(Q?) approaches 
R(Q? = œ) from above. This subasymptotic correction term, at least for the 
region above the charm threshold, is still probably less than the experi- 
mental uncertainties, so while the e*e” annihilation total cross-section is 
not an ideal place to measure «,(Q*), the overall experimental data is 
consistent with the QCD prediction of eqn (10.90). 


Inelastic IN scatterings 


According to eqn (7.10), |A> of (10.83) is the nucleon state for this case of IN 
scatterings. Hence we will be studying a physical quantity with two mass 
scales: the variable photon mass Q? = — q? — œ, but the nucleon must be on- 
shell p? = M*. One must devise methods to factorize the matrix into a 
product of momentum-independent quantities (which will be identified with 
the structure functions and the parton distribution function) and q?- 
dependent functions which scale according to the renormalization group. 
(For more explicit discussions of this factorization see §10.4.) 


(A) Operator product expansions. The technique effecting such a factoriz- 
ation is the operator-product expansion (Wilson 1969) in which the 
singularities of the operator products are expressed as a sum of nonsingular 
operators with the coefficients being singular c-number functions. The 
physical basis for this expansion is that a product of local operators at 
distances small compared to the characteristic length of the system should 
look like a local operator. 

(Al) Short-distance expansion. 


ABO) = X GE- y)OG@ + y) (10.91) 


(x— y), 790 L 


where A, B, and 0; are local operators. The @;s that can appear must have 
quantum numbers which match those of AB on the left-hand side. The C,(x)s 
are singular c-number functions called the Wilson coefficients. It has been 
proven for renormalizable theories that such expansions are valid as x > y 
to any finite order of perturbation theory. The short-distance behaviour of 
the Wilson coefficients is expected to be that obtained, up to a logarithmic 
multiplicative factor, by naive dimensional counting 
C(x) > (x) 2n xmPEl + O(xm)] (10.92) 
x«l/m 
where d,, dg, and d; are the dimensions (in units of mass) of A, B, and 0;, 
respectively. The higher the dimension of @, the less singular are the 
coefficients C(x); hence the dominant operators at a short distance are those 
with the smallest dimensions. 
The usefulness of this expansion derives from its universality—the Wilson 
coefficients are independent of the processes under consideration. Process 
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dependence is exhibited in the matrix element of the local operator 0; which is 
nonsingular at short distances. Another advantage is that in a given theory 
the expansion usually involves a rather small number of operators. Hence the 
ensuing calculation is relatively simple. 

(A2) Light-cone expansion. We already encountered this type of expan- 
sion in §7.3. Eqns (7.141) and (7.146) are examples of the generic light-cone 


expansion 
x x x: & 5 
= ——~|)~ ! t=, — ~ 10.93 
4(5)a( | coal 5 | for x ~0 (10.93) 


with singular c-number functions and regular bilocal operators 0,(x, y). 
Then one can expand the bilocal operators in a Taylor series (as in eqn 
(7.157)) to write 


x '& - 

(>. -5) = 2 K'E y „x“OR’ (0) (10.94) 
J 

so that the product of two local operators can also be expanded in terms of 

local operators on the light cone 


(5)a(-3) E S Sr dC on woz ae HOU (0). (10.95) 
x20 pi 

If we take the bases OX,’ „to be symmetric traceless tensors with j indices, 
they correspond to operators of spin j. The light-cone, x? > 0, behaviour of 
the Wilson coefficients (just as in (10.92)) can be obtained by naive 
dimensional counting 


CPx) > (fx?) 44-48 xm?) (10.96) 
x20 


where d; ; is the dimension of OX”. (0). Hence unlike the case in (10.92) the 
leading term corresponds to the lowest value of (d, ; — j), i.e. the dimension 
of OÙ? , minus the spin of OY? _,,. Such a combination is called the twist of 
an operator 


ee, ae (10.97) 


which denotes twist = dimension — spin. The operators with lowest twist 
dominate in the light-cone expansion. 

The scalar field @, the fermion field y, and the gauge field F’,, all have twist- 
one. Taking the derivative of these fields-cannot reduce the twist and at best 
leaves it unchanged because taking the derivative will increase the dimension 
by one unit while changing the spin by 1 or 0. Thus the minimum twist of an 
operator which involves m fields is m. The most important light-cone 
operators have twist-two, examples of which are listed below 


scalar: 
(10.98a) 
fermions: 

Un ae = WY, a, A öy + permutations (10.98b) 
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vector: 


OU) = Fpa Du- Duj- F + permutations. (10.98c) 
The derivatives in (10.98a,b) will be replaced by covariant derivatives D, in 
gauge theory. We have seen in §7.3 that dominance of the canonical twist-two 
operator in free-field theory leads to Bjorken scaling. 

Now we can begin to see how the deep inelastic /N scattering cross-sections 
can be factorized into two parts—one being momentum-independent and the 
other scaling in a way controlled by the renormalization group, where the 
cross-section is related to the absorptive part of the forward current—nucleon 
scattering amplitude for which one then makes a light-cone expansion; 
the maxtrix elements of the local operators will then give rise to the 
momentum-independent part and the c-number Wilson coefficients satisfy 
the renormalization group equation. We shall first show that these Wilson 
coefficients are related to the integrated moments of the /N structure 
functions. 


(B) Moments of structure functions and Wilson coefficients. In order that 
the principal festures of our manipulations not be obscured by complicated 
Lorentz structures we shall first illustrate our procedure with scalar currents 
J(x). Consider the forward scattering amplitude (Fig. 10.9) 


T(q?, v) = | dêx e7 *< p| r(4 (3).(-3)) p> (10.99) 


q q 


Fic. 10.9. 


where v = p :q/M. (Details of the kinematics may be found in §7.1.) Writing 
the operator-product expansion 


x x ie 
r( 4(5)(-3)) Ree 2, Ci(x?)x"1 . , xtiOF | (0) (10.100) 
(with the index 7 ranging over all twist-2 operators), then the amplitude in 
(10.99) becomes 


T(q°,v) ® Y, | dx eia... x Cipo? a (Olp 
ij 


0 60 0 


= 5 (i = 


i,j Ôq n, Ôq u, i Ody, 


x | fats C102) keologi? (O1P>. (10.101) 
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The matrix element of the local operator O% ’ „ which is symmetric and 
traceless can be parametrized as 


CpiOe:? Olp) = OPEPa Puz: -Puj — trace terms] (10.102) 


where OY is a constant and the trace term will contain at least one g, 
factor. Replacing 


ô ô 0 . ô N 
— = Y Ui H2 Hj 


ôq, Ôq, a Ss ôq, 
+ trace terms, (10.103) 


we obtain for large —q°, with —q°/2Mv fixed, 


T(q’, vy) z > (2i)(p: (2 3il [ats ew sci(e lof 


—q2 => %0 i J 


-y2 F) ex q3)O0’ (10.104) 


where 
Ci(q’) = (—iq°Y (ae j [ats e7 C(x?) (10.105) 


which is essentially the Fourier transform of x“!...x“iC/(x?). Note that the 
trace terms in (10.102) and (10.103) will have lower powers of (2p : q) and can 
be safely neglected in the scaling limit. Thus for the amplitude T(q?, v), 
decomposed in terms of spin projections 


T(q*, v) = 3 T;(q’, v), (10.106) 


we have from (10.104) that 


T,(q’, v) a x74 » ČOV (10.107) 
for x = —q?/2Mv. This implies that to isolate an operator of a given spin j we 


need just expand T(q?, v) in powers of x~ + for large —q*. In deep inelastic 
scattering, since one actually measures the absorptive part of the forward 
amplitude T(q’, v), 


1 
W(g?, v) = = Im T(q’,). (10.108) 


The amplitude can be reconstructed from the measured quantities by using 
the dispersion relation 


T(q’, D |e W(q’,v') + P,-1(q*, v) (10.109) 


where we have assumed s number of subtractions with P,_ ,(q*, v) being a 
polynomial in v of order s — 1 for fixed q°. If we further assume that, for 
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large —q’, we have P,_,(q*, v) > P,_,(x), a polynomial in x~' of order 
s — 1, then 
+1 
(x) W (g7, x’) dx’ 
T(q*,v) = dP 
(q ’ v) | (xA TEx men x) as S7 1(X) 
=i 
+1 
~ P+ Y x | dx'(x’)/- W(q2, x’). (10.110) 
x> J=s 


=f 


The first s terms in this expansion are undetermined because of the unknown 
subtraction constants. However for J > s (comparing eqn (10.110) to eqns 
(10.106) and (10.107)), we have 


+1 


| dxx’'W(q?, x) x ¥ CY(@’)O” for J2s. (10.111) 
— i IE 

Thus the moments of the structure functions measure the (Fourier trans- 

forms of) Wilson coefficient functions C,’(Q7). 

For the more realistic case of the electromagnetic current J > J”, we 
can make a similar analysis. With the usual assumption about the high- 
energy behaviour of the forward Compton amplitude, the relation (10.111) 
will hold for all J > 2, when the t-channel of the Compton scattering (i.e. the 
current x current channel) has the quantum number of the vacuum, and for 
J > 1 in the non-vacuum channels. When decomposed in terms of the two 
invariant eN inelastic structure functions F, ,(x, Q?) as in eqn (7.30) (we 
have changed notation from that used in §7.1: G,(x, q^) > F,(x, Q7)), the 
result corresponding to eqn (10.111) reads 


1 


l x 
| dxx” 7? F(x, Q?) z ji » C(07)0O”” (10.112a) 
1 

l Š 
| dxx! LF, Q?) ~ 5 F, CPOP (10.112b) 


=f 
We have succeeded in isolating from the cross-section, which has two mass 
scales p? = M? and q’, a factor which depends only on q*, to which we can 
apply the scale transformation q, — Aq, and renormalization group analysis. 
We note that exact Bjorken scaling (eqn (7.32)) F,(x, q*) > F,(x) cor- 
responds to free-field behaviour 


C\(Q?) > constant as Q? > œ. (10.113) 


In general we expect this to be modified by the interaction. The simplest 
possible deviation from this scaling behaviour would be such that the 
In(Q*/m’) powers in every order of the perturbation are summed up into 
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some Q* powers as 
T 1 \?s/2 
CQ?) ~ (z) (10.114) 


This can be interpreted as an anomalous dimension y, acquired by the 
operator 0%? _,, due to the interaction. As conserved quantities are finite, 
their renormalization constants do not depend on cut-off and their 
anomalous dimension vanishes. The electromagnetic currents and the energy 
momentum tensors are such quantities. If they can appear on the right-hand 
side of the operator-product expansion, then (the Fourier transforms of) 
their Wilson coefficients will scale as in free-field theory without anomalous 
dimensions. For the general moments, eqn (10.114) however implies that 
1 


YJİ2 
| F(x, Qx? dx z (=) | (10.115) 
(0) 


Q?0 \Q? 


Since the structure functions F; are positive definite, the anomalous 
dimension y; must monotonically increase with J and, since the energy— 
momentum tensor 0, „, does appear in vacuum channel of the current 
operator-product expansion, we have y, = 0 and 


y20 for J>2. (10.116) 


This means that the moments of the structure functions should in the vacuum 
channel decrease with increasing Q?. Also experimentally observed ap- 
proximate scaling implies that the anomalous dimensions must be very small. 


(C) Renormalization group equations for the Wilson coefficients. In 
Chapter 3 we introduced the renormalization group equation for a general 
Green’s function. We can obtain similar equations for the Wilson coefficients 
by comparing the renormalization group equations satisfied by Green’s 
functions containing the operator product itself and containing the local 
operators appearing in the expansion of the operator product (Christ, 
Hasslacher, and Mueller 1972). 

Schematically the operator-product expansion is of the form 


A(x)B(0) ae 2, Cx, g, HO;(0) (10.117) 


where g is the coupling constant and u is the reference (subtraction) point for 
the renormalization. Or, in terms of n-point Green’s functions with insertions 
of AB and @,, 


r x YC, g, wre (10.118) 

where i 
To} = (O|T(A(x)BO) [] PAIO (10.118a) 

k=1 


rE = COTO) [| >x(rx))10>. (10.118b) 
k=1 
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The Green’s functions separately satisfy the renormalization group equa- 
tions (see eqn (3.58)) 


È + È YG) — alg) — yao) [E u)=0 
and 
È + È yg) — a) T pu) = 0 


where 
0 0 
D = u — + P(g) — (10.119) 
a ôg 


and y4, Yg, and y; are the anomalous dimensions of the operators of A, B, and 
0;, respectively; we have assumed that they do not mix under renormaliza- 
tion. Using (10.118), clearly we have 


LD + yg) + val) — Y:(9)]C:(x, g, u) = 0. (10.120) 
Thus, the Wilson coefficient C; behaves as if it were a Green’s function of the 
operators A, B, and 0. The solution as given by eqn (3.68) takes on the form 
t 
C;(e7'xo, g, p) = eT exp} | dtl y(G(t')) + ya GC) 


= AVI} Cio g(t), u) (10.121) 


where d,, dg, and d; are the naive dimensions of A, B, and O;. Similar 
equations and solutions of course hold for the Fourier transforms of the 
Wilson coefficients Ĉ,(q?, g, u). For the case of deep inelastic scattering we 
have the light-cone expansion which can be turned into sums of infinite 
towers of local operators of increasing spin n with Wilson coefficients 
ĈČ"(q?, g, u), which are related to the moments of the structure functions by 
(10.112) 


1 


| ae 
M,(Q*) = | dxx"~*F,(x, Q*) © 8 2, CHO’, g, WO} 


0 
l S 
= z È ČKO, GO, w) exp| — | ZGO) ar lor (10.122) 
(0) 


where y$ is the anomalous dimension of the operator 0%? |, . 


(D) Deep inelastic scattering in QCD. We will now apply this analysis of 
deep inelastic scattering in QCD (Gross and Wilczek 1974; Georgi and 
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Politzer 1974), where the electromagnetic current is given by 


ny 
F(X) = S, AY, erq): (10.123) 
k=1 
where q,(x) is the quark field operator with flavour index k, e, is the charge 


carried by q,, and the sum over colour is implicit. Consider the forward 
Compton scattering amplitude (averaging over nucleon spin is understood) 


i fdt l 
Moe | Soe? pIT O, OJ OIP) (10.124) 


quady l pq pq 
= (du + Fe )r taph- a) (p -a)r (10.125) 


The absorptive parts of the invariant amplitudes 7; are just the structure 
functions W, measured in deep inelastic scattering 


1 
Wi, 20; 92) = — Im Ty, ay, q°). (10.126) 


(D1) Operator-product expansion and moments of structure functions. The 
operator-product expansion on the light cone is of the form 


ITI OIO) = X, {Ium Vua Yud CO, 9, y) 


+ Guy Jongh us: -Yu TCPO, g, YOM Ha(0). (10.127) 


In QCD there are three sets of gauge-invariant twist-2 operators (see eqns 
(10.128a, b, c) below) which dominate expansion near the light cone y? x 0 


a 


HE A 
ogma) = 5 Jao De... Digo) 


+ permutations of vector indices (10.128a) 


where the A%s are the standard n; x np hermitian traceless matrices in the 
flavour group SU(n,); thus for a theory with three flavours u, d, and s they 
are just the familiar SU(3) Gell-Mann matrices. This set of operators will 
contribute to operator-product expansions for the flavour non-singlet 
combinations of structure functions such as F? — FS or FY, for neutrino 
scattering off an isoscalar target A. Here we will devote most of our effort to 
the study of the more involved case of the flavour—singlet combinations. 
They can receive contributions from two sets of operators 


ome tee n(x) 
*n—1 
= —_- {q(x)y"tD” ...D*q(x) + permutations} (10.128b) 
Ome ee n(x) 
i? 2 


tr{G "D" ...D#-1G%" + permutations}. (10.128c) 
n! 
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Because 0” and (@ have identical quantum numbers, they can mix under 
renormalization (see §2.4). 
Substituting (10.127) into (10.124), we obtain 


o diy. 
T (p,q) = ae a IuvY n Yuz- -Yan CPO, g, p) 


T Dudar (Vi +i Vy JCP Cy? QJ; y)} 
x <plOmr---#n(0)[p> +o... (10.129) 


Writing the spin-averaged matrix element as 
< plowes:--#n(O)|p> = AL? (p"ip"2...p"" + trace terms) (10.130) 


where A” is a constant, eqn (10.129) becomes 


1 ô N" BS 
ToP) ve Ara | = dup on) [ay e CP y, g, u) 


ô 
+ PaP p D" k ) | diyen Cyg, DAP +. dea 
= l Co ) 
2M5 —~ Gry qd, g, H 
T E 
+p £ y C479, DMAP +. (10.131) 
—q 2p-q 


where 


C10’, g, u) = (Q s 2) [a yet CDV, g, y) 
C(O? g, u) = (Q?)"" (a >i |e e't ICP (y, g, u). (10.132) 
From (10.131) we can U read out the invariant amplitudes 
T, (x, 0) = — a x "ČP (Q?, g, nA” (10.133a) 
T,(x, Q*) = i x"t1E6M(Q?, g, WAP. (10.133b) 
By the same route we took going from (10.106) to (10.112), we can obtain 


relation between the moments of the structure functions and the (Fourier 
transforms of) Wilson coefficients 


1 


ier 
| dxx"™ tF, (x, Q’) © 3 CP (Q*, g, WAI” (10.134a) 
l 
1 
Lers 
| dxx"™ "F(x, Q*) x 4 2, C2(Q*, g, p)AP. (10.134b) 


0 
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(D2) Renormalization group analysis and anomalous dimension matrix for 
the singlet case. We shall concentrate on the flavour singlet case. Because of 
mixing among twist-2 operators, the renormalization group equation for the 
Wilson coefficients takes on the matrix form 


D ôn- ČP, 9, u) = 0, «= 1,2 (10.135) 
where y” is the 2 x 2 anomalous dimension matrix for the flavour singlet 
operators O and 0H. The solution to eqn (10.135) is 

t 
CP (Q/K, g) ~ > CP, A) api- [apao (10.136) 
i 0 
where 
= 7 In(Q?/p’). (10.137) 


In asymptotically free QCD one can calculate y” perturbatively according to 
the one-loop diagrams of Fig. 10.10 


y = dg? + O(g’) (10.138) 
oF oO” ow 
+ 
(a) (b) 
oF ow” ow ow) 
+ + 
(c) (d) 


Fic. 10.10. Graphs contributing to (a) »"?; (b) y@); (c) 72; and (d) yg. 
(n) (n) 
qd” = daa dög 
ij d™ q™® 
qG GG 


8 2 nq 
(i sen 5) 


I 1 a 


with 


3 n(n? — 1) 


n7+n+2 l 


= ad 1G) 


4 n 
da = Ter? fel ae eD aD 4d | “iq mf ere 
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n; is the number of quark flavours. This matrix can be diagonalized by 
taking appropriate linear combinations of OY and OP and its eigenvalues 
correspond to the anomalous dimensions of the linear combinations that are 
multiplicatively renormalizable. Using the lowest-order expression (10.80) of 
the effective coupling constant in (10.138), we have 

t t 

2d dt’ 
= fa WEIEN) = — ree 
(0) (0) 
— gin) (n) 


de — dl 
= a | 2bg7t) > J Int. 10.140 
>} n(1 + 2bg*t) J n ( ) 


The Wilson coefficient functions have the large-q° behaviour 


(n) 
2 - Q? —dji /2b 
CAE, o) ~ ZEA o| n( 2) | (10.141) 
u 
J 
where the Ĉ w (— 1, 0)s are the Wilson coefficients in free-field theory. For 
large Q?, (10.134) now reads 


1 


MPQ’) = | dxx"™ F (x, Q*/u") 


Md 
tw 


¥ CH (1, 0)AM[In(O2/p2)]- 42> (10.142a) 


~~ 
fw 


1 
MPQ’) = [axe trex Q*/") 
0 
l 
4 


P (n) 
2 CPA, OAMLn(Q?/u)]~ 447° (10.142b) 


These are the principal QCD results on deep inelastic scattering. We have 
obtained them by factoring the inclusive cross-section into a momentum- 
independent part (the local operator) and a part that scales according to the 
renormalization group (the Wilson coefficients). In the asymptotically free 
QCD the leading singular (Q? — œ) behaviour of the Wilson coefficients can 
be calculated in terms of the renormalization group f-function and the 
anomalous dimensions yf), while the matrix elements A” of the local 
operator cannot be obtained without solving the (long-distance) bound-state 
problem in QCD. While we cannot calculate the scaling functions them- 
selves, nevertheless we do have enough information on the pattern of scaling 
and scaling violation. QCD predicts that moments of the structure functions 
have a very weak dependence on q? as in eqn (10.142). So we have 
approximate Bjorken scaling with logarithmic violations. 


(D3) Momentum sum rule revisited. As an illustration we shall work out 
the flavour-singlet second moment (n = 2) of the structure function, say 
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F(x, Q”). In this case one of the spin-2 and twist-2 local operators will be the 
energy-momentum tensor 6,,. Its conservation implies no renormalization 
Zo = 1; hence we should find a vanishing anomalous dimension y, = 0. (For 
a related discussion on the renormalization of the conserved vector current 
operator Z; = 1, see §6.1.) For n = 2, eqn (10.128) indicates that there are 
two twist-2 operators 


[OP Vay aa 4 > kly 0, + Yy 04) 4k (10.143a) 
k=1 
LORIO = tr(G2G*). (10.143b) 


We are taking only the lowest-order terms; thus G 
matrix in (10.139) takes on the value 


l l6 —3n 
d? = — ry. 10.144 
4 36n? (_ 16 A l ) 


It clearly has a zero eigenvalue d,=0 corresponding to the left 
eigenvector (1, 1) = (1, 0) + (0, 1), i.e. the combination 


O= TO E OS lay (10.145) 


which is just the energy-momentum tensor. The other eigenvalue is 
dy = (16 + 3n,)/36n?. We can in general express dẹ? in terms of its 
eigenvalues d, and their projection operators P; with respect to the eigen- 
vectors 


y= 0,A, — 0,A,. The 


u 


dl) = > A Pi)ij> P,P, = On Pko 5 P, = i (10.146) 
k k 


It is easy to see that for our n = 2 case, we have the projection operators (for 
left vectors) 


l Bnp Sne l 16 — 3ng 
a eee Poo : 10.147 
Ps azh e e ali a Oea 


The second moment of the flavour singlet structure function can be written 
as 


i,j 


1 

1 s 
| ixrac, Q’) = 8 SERA, O(P) AP 
0 


+ CPA, OPa) AP [n(Q?/p?)]- %2}. (10.148) 


JoJ 


Since d, > 0, we have for Q? > œ 
1 


| dx F(x, Q?) = D {CP(1, O(P) AP). (10.149) 


i, j 
0 
To calculate the right-hand side we note that, to lowest-order in g(t), only the 


quarks contribute to the (current) operator-product expansion; hence the 
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free-field values of the Wilson coefficients are 

CH6(1, 0) = 0, CP(1, 0) = <e> (10.150) 
where 


l 
<e> = —) e (10.151) 
Ne k 


is the average quark charge squared. Thus the right-hand side involves only 
È, (P9),j;4\”’. To evaluate this we first note 


(2) 
(1, 0) l 3n, 3ne\ (O 
16 + 3n;\16 16/0% 


3N¢ 
= (0% + OY) = ré 10.152 
AT i cet ime) 
where @ is the energy-momentum tensor operator (10.145) and where 
3n¢ 
na ee 10. 
"16 + 3m eae 


can be interpreted as the fraction of momentum carried by quarks. For three 
flavours of quarks, r = 9/25 which is in accord with the experimental results 
when interpreted in the quark—parton model (momentum sum rule) of §7.2. 
Unlike in the general case, the matrix element of 6,, is known 


(plô alp? = PuPy —GyyP- (10.154) 


When eqn (10.154) is compared with (10.128) and (10.130), we have 
(Puy AP = 8r; in this way eqn (10.149) becomes 


1 


| dxF,(x, P) | = reĝ). (10.155) 
. singlet 


This means that the area under the structure-function curve scales without 
deviation. 


(D4) Pattern of scaling violation. For general n, the diagonalized ds 
have the property of increasing slowly with n. For n > 2, where the large x 
(~ 1) region is important, the moment M”(Q7) decreases as [In(Q7)]~'9""7°; 
for n < 2, where the small-x (~0) region is important, the moment M$”(Q7) 
increases as [In(Q7)]*'"’?". This implies the following pattern of scaling 
violation; as Q? — oo, the large-x part of F,(x, Q?) decreases while the small- 
x part increases, while the area under the curve remains unchanged 
throughout (see Fig. 10.11). In the infinite Q? limit, the structure function 
approaches a sharp spike at x = 0. 

In concluding this section we will briefly discuss a type of experimental 
check on QCD predictions that has been performed. To this end we will 
consider the simpler case of a non-singlet combination of structure functions 
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F,(x,Q*) 


x 


Fic. 10.11. The pattern of scaling violation of F,(x, Q7) for Q3 > Qt. 


where there is no complication due to operator mixing. Then the result 
corresponding to (10.142) reads 


[M$(Q7)Jnon-singier = BO [In(Q?/p?)] 0°17? (10.156) 


where B™® is an unpredicted constant and d™ is the g% coefficient of the 
anomalous dimension for the operator in (10.128), 


l 2 rd 
g™ —-— |1 — AN S 10.157 
6n? | n(n + 1) i 2 | l ) 
which is the same as d‘? as one would expect. Now we take logarithms of the 
moments 
— qin) 
In M™(Q°) = In In(Q?/u*) +... 


2b 
(n’) 


In M"(Q?) = 


In In(Q?/u*) +... (10.158) 


The dots on the right-hand sides of (10.158) represent non-leading terms 
(which could still be significant at the present energy level). 

If we plot these two logarithms against each other, we should get a straight 
line with slope d”/d™. Such experimental plots have been found to agree 
with the QCD prediction within limits of experimental error, see, for 
example, Bosetti et al. (1978) and de Groot et al. (1979). 


10.4 The parton model and perturbative OCD 


In this section we present a brief introduction to the study of perturbative 
QCD. We shall concentrate again on the prototype processes of deep 
inelastic lepton—hadron scattering. First we present the parton-model proba- 
bilistic interpretation of the QCD result obtained in the previous section. 
We then show in what ways the same result can be recovered by summing 
the leading logarithms in perturbation theory. 

Let us first recapitulate the QCD result for deep inelastic IN scatterings 
obtained by using the operator-product expansion and the renormalization 
group equation. For the nth moment of the flavour non-singlet combinations 
of the structure functions F,(x, Q”), the result (10.156) may be written 

x(t) 


M(t) = M™(0) RA < (10.159) 
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where ¢ = +1n Q7/Q3, a(t) is the effective strong gauge-coupling constant 
squared (10.82) and b and d™ are the leading coefficients of the renormaliza- 
tion group f-function (10.77) and anomalous dimension (10.157), respect- 
ively. Thus QCD predicts ‘almost scaling’ with a logarithmical violation. 
This pattern is illustrated in Fig. 10.11. 


The parton picture of scaling violations 


How are we to understand this QCD pattern of scaling violation in the 
parton model or, for that matter, patterns of scaling violation in general? As 
we have discussed in Chapter 7, strict Bjorken scaling can be obtained if we 
are allowed to make an impulse approximation and the reaction may be 
viewed as an incoherent sum of scatterings off the (free) constituent of the 
target. Thus this picture is applicable to any weakly bound system, and can 
be viewed as a process in which the virtual photon probes the structure of the 
target. From the uncertainty principle a virtual photon with mass (Q7)!/2 
will resolve structure on the length scale of (Q?) 12. When we increase the 
virtual photon mass, then the structure at shorter distances will be revealed. 
Typically this picture of matter as ‘a box within a box, etc.’ (i.e. discrete 
levels) will lead to scaling violations at particular intervals of Q? (when the 
next level is reached). Thus we would generally observe scaling and rescaling 
as layers of matter are unravelled. At these scaling violation junctures, the 
structure function will change with Q? and get redistributed towards the 
region corresponding to smaller values of the scaling variable x, as each 
constituent will carry a smaller fraction of the target momentum. 

In such a parton picture, the QCD result can be interpreted in terms of 
hadrons having a continuous set of constituent layers. At any particular Q? 
the shape of the nucleon structure function can be understood schematically 
as in Figs. 7.4-7.6. As we increase Q? and penetrate deeper into the dressed 
quark we will find more virtual quarks and gluons and the valence quarks 
will have to share the original nucleon momentum more and more with the 
gluons and sea quarks. Ultimately the structure function approaches a delta 
function at x = 0 


F(x, 02) = ôo. (10.160) 
Q?-+00 


The Altarelli—Parisi equation 


Thus, according to the above picture, the QCD scaling violation comes from 
the fact that the effective strong coupling «,(t) does not vanish fast enough as 
Q? — oo. Even though the theory is asymptotically free there are still some 
residual interactions at short distances. All this can be cast in well-defined 
quantitative expressions as first suggested by Altarelli and Parisi (1977). For 
a simple presentation see (Close 1979). 

At t = $1n Q7/Q%, let the probability of a quark carrying the fraction x of 
nucleon momentum by q(x). As we increase the virtual photon mass t to 
t + ôt an additional probability 6q(x) may be revealed corresponding to the 
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(a) (b) 
Fic. 10.12. 


possibility of another quark with momentum fraction y > x radiating away a 
gluon and reducing its momentum from y to x (Fig. 10.12(a)). Thus we can 
think of this quark as being contained in the one originally observed at a 
lower value of t. This suggests that the quark distribution is actually t- 
dependent and that we can define a quantity P,,(z) corresponding to the 
variation of the probability distribution (per unit ¢) of finding a quark ina 
quark with z = x/y fraction of its momentum (y). This variation is clearly an 
order-g’ effect; consequently we have the following evolution equation 


d 
ne = g*(t) |e |e O(x — yz) Pa(z)q(y, t) 
x 0 


1 


d 
= ro |S aalX/V)G t). (10.161) 


x 


This convolution integral can be disentangled by a Mellin transform as the 
moment of the product is the product of the moments of the functions. 
Define the moments of the distribution (i.e. structure) function 


M(t) = | doga t). (10.162) 


(0) 


Eqn (10.161) then reads 


du” i d 
T= FO | aor |Z Paeon 
(0) x 


1 


= g*(t) | dyy"q(y, t) | dzz"P,,(Z) 
(0) (0) 
= g°(t)M™(t)D™ (10.163) 


where 
1 


D” = | azz aq(Z)- (10.164) 


(0) 
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Thus the parton picture of scaling violations leads to this differential 
equation of the moment of the structure function; its evolution 1s seen to be 
controlled by the t-dependent quark—gluon coupling «,(t). The solution to 
(10.163) can be easily obtained by integration 


dM (t) E D” dt 
M(t) 2b t 


or 


M”) ae ‘a (10.165) 


M”(to) — |.a(to) 


This is precisely the QCD result for non-singlet moments (10.159) if we make 
the identification 


D® = —d, (10.166) 


Thus the moments of the Altarelli—-Parisi function P,, are simply the non- 
singlet anomalous dimension coefficients d. 

When we extend the above analysis to the flavour-singlet combination of 
structure functions, we have the additional feature that the (singlet) gluon 
distribution function G(x,t) also contributes (see Fig. 10.12(b)). 
Corresponding to (10.161), we now have 


1 


d d 
1D L BU) | — [Paa(/y)40; 1) 
4 
+ Pod x/P)G(y, D] (10.167) 
1 
dG d 
EE L go | Z Leola 
a4 


x 


+ Pec(x/y)GQ, t)]. (10.168) 


(Appropriate summation over the quark flavour indices to obtain the singlet 
combination is assumed.) Eqn (10.167) shows that the singlet quark 
distribution varies with ¢, not only because of gluon bremsstrahlung, but also 
because gluons can convert into quark and antiquark pairs in a flavour- 
independent way, etc. The coupled evolution equations then correspond to 
operator-mixing in the more formal approach of using the operator product 
expansion and the renormalization group equation. The moments Paa; Pac, 
Poq, and Pog can be identified with (negative) anomalous dimension 


coefficients d, d'2, dQ, and d&}. 


Perturbation theory and the parton model 


Now we see that QCD results such as (10.159) have a simple interpretation in 
the parton model with quarks and gluons. This suggests that they can be 
obtained directly in perturbation theory without invoking the formal 
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apparatus of the operator-product expansion and the renormalization group 
equation. Before performing this perturbative calculation we shall give an 
overview of the problem of obtaining the parton-model result in perturbation 
theory. 

As we mentioned in the last section, although QCD is asymptotically free 
with a decreasing effective coupling constant g7(q7) ~ (In g?)~', it is still 
difficult to apply perturbation theory directly to many high-energy processes. 
There are at least two reasons for this situation. 


(1) Perturbation theory deals with quarks and gluons, which are not 
physical asymptotic states. Thus, by and large, straightforward applica- 
tions of asymptotic freedom are restricted to calculating gross features that 
are independent of the detailed final-state properties. Quantities such as 
high-energy e*e~ hadronic total cross-sections and deep inelastic IN 
inclusive cross-sections are determined by the initial response of the system at 
short distance-time and are insensitive to the complicated nonperturbative 
process which turns quarks and gluons into hadrons with unit probability. 

(2) Physical quantities usually depend on, in addition to the effective 
coupling g(q*), some mass parameter m. Thus if one attempts to expand 
physical quantities in powers of g(q°) in the large-g? limit, one generally also 
encounters terms of the form [g?(g7) In g?/m?]" which spoil the expansion. 
Physically, sensitivity to m indicates that the large-distance properties of the 
theory are involved. Thus the use of perturbation theory in QCD requires 
that either of the following conditions be satisfied. 


(A) Ing?/m? terms do not occur. This is the case for o,,,(e*e~ 
— hadrons), and o(e*e” — jets) where groups of particles within some 
narrow cones are summed (see §7.2). They are finite in the m — 0 limit and 
therefore are free of In g?/m? factors. 

(B) In q?/m terms can be summed up and the sensitivity to m can be 
somehow factored out. This ‘factorization of mass singularities’ has 
already been briefly mentioned in the previous section and it occurs in the 
case of deep inelastic lepton scatterings. To see the usefulness of the 
factorization property more explicitly, consider a dimensionless ob- 
servable P(g(7), 07/7, m?/u7). Since the value of ¥ cannot depend on the 
scale chosen to define the coupling constant g, we have 


dv 0 ôg ô ôm ô 
— =| u— a —— |¥ = 0. 10.169 
Yau (Hae ee) 
This is the renormalization group equation for the observable. If we can 
establish that  factorizes as Q? — œ 


ae C(g(u?), Q7/u?)D (g(u"), m° /u?”) + O(m?/Q’), (10.170) 


then (10.169) can be used to determine the Q?-dependence 


dv dc dD 
7 = = gee | 10.171 
ET (1 <)p +(x ae ( ) 
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or 
u dC udD | 
Cdu Ddu 
where the separation constant can depend only on g and C satisfies 
the renormalization group equation in the standard form 


y(g) (10.172) 


ô ô 
c z t Ages xoc (g(u°), Q7/u*) = 0. (10.173) 
p g 


From this and y(g) = g’*d we obtain the familiar solution (e.g. see eqn 
(10.159)) 


A 
g’(u") 
=C (0, 1)[1 + bg*(u) n(Q?/u)] P (10.174) 


where we have used the explicit form of g?(Q7) as given in eqn (10.80). We 
note that the function C(g(y), O7/u*) depends on g(u) and Q?/y? only 
through the combination g?(y) ln(Q?/py’). That each factor of g?(u) is 
accompanied by In(Q?/y?) clearly indicates a correspondence to a 
summation in perturbation theory of the leading logarithmical terms, 
which appear as g?"(u?)(In(Q?/p7))". 


We saw in the previous section that factorization of mass singularities is 
accomplished using the operator-product expansion; the c-number singular 
functions then satisfy the renormalization group equations. The present 
discussion then suggests the following correspondence between the formal 
field-theoretical apparatus and the simpler perturbative procedures 


C(g(u), Q*/n7) = C(O, o| 


Formal apparatus Perturbative procedures 
operator product expansion + factorization of mass singular diagrams 
renormalization group equation + summation of leading logarithms 


It should be pointed out that, while this translation table may be helpful in 
clarifying the meaning of our calculational procedures, in most cases no such 
simple separation of steps is possible. In fact factorization is usually not 
achieved until summation of logarithms have been performed. 

Before further discussion of perturbation theory and the parton-model 
results we will first carry out an explicit calculation illustrating the points 
made above. 


Perturbative calculation of deep inelastic scattering 


We are interested in the forward Compton scattering amplitude of a virtual 
photon (with momentum q,) by a quark (momentum p,). Corresponding to 
the operator-product expansion, here we need to identify and factorize 
diagrams containing powers of In p*. Thus we are looking for diagrams that 
are divergent as the mass p? = m? —> 0. There are two types of divergences, 
which are often called infrared divergences and mass singularities. 
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The first type of divergence is brought about in the phase-space region 
when the momentum of the (real or virtual) massless particles vanishes. The 
Bloch—Nordsieck theorem (1937) assures us that such infrared divergences 
cancel in the inclusive cross-section. But all cross-sections are in fact inclusive 
because of the finiteness of energy resolution in all experiments. Essentially 
this situation still holds for the deep inelastic scattering. 

We will concentrate on the divergence resulting from mass singularities. 
These mass singularities occur in theories with coupled massless particles, 
and are due simply to the kinematical fact that two massless particles (with 
momenta p and p’, say) which are moving parallel to each other have 
combined invariant mass equal to zero 


k? = (p + p’? =2EE'(1 —cos@) ~0 as 6-0 (10.175) 


even though neither p, nor p, are soft. This is sometimes called collinear 
divergence. There is also a theorem, by Kinoshita (1962) and by Lee and 
Nauenberg (1964), which can be roughly stated as follows. For inclusively 
enough cross-sections the mass singularities also cancel. The physical basis 
of this theorem is very similar to that of Bloch and Nordsieck; i.e. in physical 
measurements the angular resolution is not perfect and we sum over all 
states within the finite angular resolution. Thus in our calculation we will 
not be concerned with mass singularities coming from undetected final-state 
particles moving parallel to each other, since they cancel. In deep inelastic 
scattering, however, there are still mass singularities left over. They arise 
from regions of phase space where the internal momentum of a massless 
particle is parallel to that of an external massless particle to which it is 
coupled. Consider the one-loop diagrams in Fig. 10.13. Let k = p + p’ be the 


q q 


(b) 
Fic. 10.13. Mass singularities are present in (a) but not in (b). 0 is the angle between p and p’. 
The heavy-lined propagator diverges as 07? in the limit 0 — 0. 


loop momentum. Hence we have a factor of 07? from each of the 1/k? 
propagators, and 0 d0 from the d(cos @) factor in dtk, and finally each 
massless particle vertex contributes a power of 0. Thus Fig. 10.13(a) has mass 
singularity d0/0 ~ In p? and Fig. 10.13(b) is finite. We shall therefore 
concentrate on Fig. 10.13(a) and its generalizations to higher orders. 


One-loop diagrams. It is convenient to use the axial gauge, eqns (9.41) and 
(9.108), 


n,A" = 0 (10.176) 
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where 7, is some four-vector. The gluon propagator in this gauge is given by 


d,y(k) 


Ee) k? + ig 


with 
dk) = Guy — (Kany + Ayn, kn) * + 7k, kk -n)~?. (10.177) 


There is no Faddeev—Popov ghost in this gauge, and the calculation is 
greatly simplified. To take the absorptive part of the amplitude we replace 
the propagator (k? + ie)~! by 26(k*). Thus the diagram in Fig. 10.14(a) 


(a) 
Fic. 10.14. The absorptive part of the forward Compton scattering amplitudes. 


contributes to the hadronic tensor of the inelastic lepton scattering cross- 
section (eqn (7.10)) as 


g’sa(V) | d*k d((p — k)*) 6(q + k)”) 
2M = J (2n)° (k?)? 


W (P, q) m d?°(p = KT wise 


where 
T uvo = 2 tL py Ky, (K + @)y Rye] (10.178) 


and s (V) is the usual colour factor of eqn (10.88) and equal 4/3 for 
SU(3).ciour- It is convenient to use the Sudakov variables (1956) 


k, = Ep, 4+ Bai, He kii (10.179) 
with 
'=q+xp and ki p=kr '— Q0. (10.180) 


We assume that q? x% p° x 0 and that p-q' = p:q, together with —q°, are 
large. 


dtk = 2r(p : q) dB dé dk? (10.181) 


As we shall see, the final In( — q?/p°) term comes from regions of phase space 
corresponding to f x 0 and ¢ = x. Thus the variable x introduced in (10.180) 
has the usual parton-model interpretation as the fraction of the longitudinal 
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target momentum. (This also a posteriori justifies taking q'? ~ 0.) In terms of 
these variables we have 


| d*k 6((p — k)*) lla + k)*) = 2np-q [ap dé dki 


1 k? l l—x k? 
a di ee ee O E T ; 
“q0 — 8 (s iz (: s+) 
(10.182) 


Squaring (10.179) we obtain 


4 
leg 


k? = k? + 2BE(p-q) = k? + k? = k21 — ë). (10.183) 


Therefore (10.182) can be rewritten 


k? 
[ats ôl — k)?) Òa + k)”) = er [as og (6 ~ 2p -) 


2p°q 
Contracting the gluon polarization tensor d°’(p — k) with the trace term 
T uvo (taking y = q') and doing the Dirac algebra yields a leading term 
proportional to the tree graph (eqn (7.53)) 
1+ 1 
ë 5 teLxPy (4 + xp)y,]. (10.185) 
This results from the relative contributions of the first and second terms of 
(10.177) (1 — ëk? and 2ék?/(1 — &), respectively) and from dropping the 
contribution of the third term which is of the order of (k*)* and thus does not 
contribute to the final In(Q?/p”). Collecting terms (10.184) and (10.185), 


2 2 2 
g^ (dk (1+ x*\| x p,py 


Comparing (10.186) with the simple parton-model result of eqn (7.55), we see 
that the one-loop diagram makes the following contribution to the parton 
distribution function f(x) (which is one originally); 


dk? 


x a —~x+(1—x) sa) (10.184) 


d°"(p = K)T juve = (4k) 


Uf (x) = 49°PO) Sr (10.187) 
with 
1+x?\ 1 
P(x) = (25) sr (10.188) 


We choose to write it in a form that will be more useful for interpretation and 
for higher-order calculations 


1 dk? ]{ (dé x 
ya [KSE aoa 
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Thus the diagram in Fig. 10.14(a) with its gluon bremsstrahlung introduces a 
momentum dependence in the quark distribution function having an 
Altarelli—Parisi function of P(x). If we have been careful in keeping track of 
p? terms, we will find the limits of integration for the dk? integral to be 


Q? 

dk? aa 

7 = In(Q*/m’) for p xm? (10.190) 
p? 


Thus the one-loop diagram (Fig. 10.14(a)) has a mass-singular form 


2 
f(x, 0) ~ [g morma (5 ) (10.191) 


Higher-order diagrams. Features similar to those in one-loop diagrams also 
appear in higher-order contributions. The general feature (Gribov and 
Lipatov 1972) is that the dominant contributions in the axial gauge come 
from the ladder diagrams (Fig. 10.14(b)) from the region of phase space 
corresponding to 


p? «k? «k3... «kå <« Q’. (10.192) 


It is understood that in Fig. 10.14(b) the vertex and self-energy insertions are 
included. One of the effects of these insertions is that at each vertex we should 
use the momentum-dependent effective couplings g(k?). 

For each rung of the ladder we have a box diagram similar to the one we 
have just evaluated with the result written in the form of (10.189). Now for 
all the transverse momentum integrals dk? , corresponding to the square 
brackets of (10.189), we have 


Q? ki 
dk{ dk3 
1$°(Q?/m*) = | TAAG j LOR 
ky, 
dk? , 
| ean 


pn, (10.193) 


1 
N! 


Q 
` 1 dk? gk) 4 
N! kI 


Using the asymptotic form of g?(k*), we have 
Q2 
E | din(k?) 1 


Jb in k? = WB [In In(Q?/u?) — In In(m?/u®]. (10.194) 


m2 
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The integral over the longitudinal components of momentum, correspond- 
ing to the curly brackets of (10.189), is of the form 


dé, (dé, dé, x 
IPV = | — | =... | — ô| 1 -—-— 
[$ EZ z) 

Ča Č3 0 


x P(E,)P(E2/E1)--- P(Cn/En-1)- (10.195) 


The fraction of momentum p carried by quark ï is č; and that by quark i + 1 
is €,, ,. The argument of the Altarelli—Parisi function is the ratio of these two 
fractions €;, ,/€;. This equation is in a multiple-convolution form and we can 
take moments to simplify the result 


1 1 


| airo = f dez"P(2)| = [d)]”". (10.196) 
0 0 


Then the moment of the structure function coming from the N-rung ladder 
diagram is 


l 
MP’) = =, Lead)" (10.197) 


Summing over N we get 


M(Q?) = $ MPQ?) = exp[pd”] 


qd™ 
= exp on [In In(Q?/p*) — In In(m*/p") i 


m [In(Q?/p?)]2"22 [In(m?/p7)] —d(n)/2b (10. 198) 


This shows the factorization property that M”(Q*) decomposes into a 
function of Q7/u* multiplied by a function of m?/u?. Note that this 
factorization takes place only after diagram summation; individual terms do 
not factorize. 
Finally let us check to see whether d agrees with the result obtained 
previously. From (10.196) and (10.188), 
1 1 P 
d™ = | dz2"P(z) = — | Pd cies) (10.199) 
31 l-z 


0 0 


which actually diverges as z > 1. More carefully analysis shows that when we 
did the calculations for Fig. 10.15(a) we should also have included the vertex 
correction diagram (Fig. 10.15(b)) for the Bloch—Nordsieck infrared diver- 
gence cancellation to take place. This corresponds to the replacement 


sO 4, (f@-s/ 4 
l-z l-z 


(10.200) 
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Fic. 10.15. 


Thus d™ can be calculated 


l fa +z’) -2 


l-z 


-3a | Z0 te = 204) = 7 
37 x 


0 


l 2 Tod 
a Í EE + 1 Hl (10.201) 
Thus by summing leading logarithms we obtain exactly the same results as 
obtained by using the more formal procedures of renormalization group 
equations. 

Once we understand the ideas and calculations for the familiar deep 
inelastic case, it is conceptually fairly straightforward to generalize the 
scheme to other hard scattering processes. The real advantage is that in most 
such hard scattering reactions a direct application of the operator-product 
expansion is not feasible. Thus we can use the perturbative QCD approach 
to calculate the Drell-Yan process of inclusive production of lepton pairs 
with large invariant mass in hadron collision (see §7.3). Here factorization is 
achieved and one finds the same parton distribution functions there with 
momentum dependences as those in the deep inelastic case. Another 
outstanding example of the perturbative QCD calculation of physical 
quantities that are not ‘infrared sensitive’ is the prediction of the final-state 
angular distribution (jets) (Sternman and Weinberg 1977) and energy flows 
in high-energy e*e annihilations into hadrons (Brown and Ellis 1981). 


10.5 Lattice gauge theory and colour confinement 


Quantum chromodynamics has the remarkable property of being asymptoti- 
cally free. The vanishing for short distances of the effective coupling gives the 
correct description of Bjorken scaling (and its violation by logarithms). The 
same statement also suggests that the effective coupling increases for long 
distances and this points towards a possible resolution of the central paradox 
in the phenomenological quark picture, i.e. quarks must behave like free 
particles for short distances while they must also be completely confined on 
long time and length scales. Of course strong coupling itself is not enough to 
explain quark confinement. To do so one must show that in QCD the particle 
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spectrum is realized in terms of bound states of quarks and gluons, etc. In the 
following we shall first give a brief qualitative discussion of how quark 
confinement might come about in the asymptotically free gauge theory. This 
should motivate the lattice method of defining such a theory and we will then 
demonstrate that the confinement property can be obtained in a strong 
coupling approximation of the lattice gauge theory. 


Qualitative picture of confinement 


Qualitative ideas about the nature of confinement that have some correct 
physical consequences tend to picture quarks as being bound by ‘strings’ 
(Nambu 1974) or tubes of colour flux. It has been suggested that the QCD 
vacuum is a condensate of gluons and as well as light quark—antiquark 
pairs. (For a review see, for example, Mandelstam 1979.) This is somewhat 
analogous to the ground state of a superconductor. There the condensate of 
paired electrons gives rise to the Meissner effect of magnetic flux exclusion 
from the condensate unless the energy balance favours a local breakdown to 
the normal phase. And one can imagine placing a pair of magnetic 
monopole and antimonopole into this superconducting medium; the mag- 
netic flux will be confined to a string-like configuration joining the pair of 
monopoles. Analogous to this situation the energetically favoured configura- 
tion of, say, a quark and an antiquark, has a strongly localized normal 
region connecting the pair in which the colour (electric) flux lines are 
restricted (see Fig. 10.16). By translational invariance the energy density of 


—— <—- 
=== 


Fic. 10.16. Lines of force between a quark and an antiquark. When the quarks move away 
from each other, the breaking of the string is accompanied by further quark—antiquark pair 
production. 


the gluon field along the flux tube is a constant. Hence the total field energy is 
linearly proportional to distance. This means the quarks are confined by a 
linearly rising long-distance potential 


E(r) > Kr (10.202) 


where r is the separation between the sources and K is a constant referred to 
as the string tension. Thus after the production of a quark-antiquark pair 
(say, in e*e7 annihilation) it becomes energetically favourable with increas- 
ing separation for the string to break and produce another quark—antiquark 
pair, and so on. This proceeds until the original string is broken down into 
several strings of length typical of the hadronic size of ~ 1 fm (~5 GeV~*) 
corresponding to the original quark pair being converted into a whole set of 
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hadrons having a typical energy-mass of 1 GeV. Thus we can crudely 
estimate the string tension to be 


K ~4GeV?. (10.203) 


The string model for hadrons has another consequence in agreement with 
observation: hadrons lie on Regge trajectories (Regge 1959) with slopes 
~ 1 GeV~?. A decade of intense study of hadron dynamics through their pat- 
tern of particle exchange in which unitarity and the analyticity property of 
the scattering amplitudes play a crucial role has led to many insights into 
particle interactions. One of the important discoveries in this S-matrix 
approach (Chew 1962) was that hadrons of a given internal symmetry 
quantum number but different spins obey a simple spin(J) — mass(M,) 
relation (Chew and Frautschi 1961); we say they lie on Regge trajectory 


J = % + a’ M?, 
with 
a’ ~ 1GeV~?. (10.204) 


Now imagine two massless (and, for simplicity, spinless) quarks, connected 
by a string of length d, rotating with the speed of light (Gasiorowicz and 
Rosner 1981). Thus each point, at a distance r from the centre, has the local 
velocity v/c = 2r/d. The total mass is then 


d/2 
Kdr nK d 
0 


and the total angular momentum is 


d/2 


2 
ian Krv dr nK d 


0 


Thus the string tension K of (10.202) can be expressed directly in terms of the 
Regge slope «’ of (10.204) 


7 1 
— 2na 


(10.207) 


The experimental value «’ in (10.204) leads to a K = 0.2 GeV? in qualitative 
agreement with the rough estimate of (10.203). 


Field theories on the lattice 


In order to study these long-distance properties of QCD we need to 
regularize the theory that is independent of the usual Feynman diagram 
expansion, which is appropriate for weak couplings. For this purpose Wilson 
(1974, 1975) introduced the lattice gauge theory in which the space-time 
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continuum is discretized. This provides a natural cut-off scheme as 
wavelengths shorter than twice the lattice spacing, a, have no meaning and 
this restricts the domain of momenta to a region bounded by z/a. Also this 
formulation of the field theory allows for a close analogy with a statistical 
mechanics system. So we can call upon all our experience and intuition of 
statistical mechanics to solve problems in quantum field theory. With a cut- 
off on high momenta the kinetic energy is bounded; we can then treat it as a 
perturbation in the strong coupling limit. This corresponds to the method of 
high-temperature expansion in statistical mechanics. With a finite lattice 
there are a finite number of variables. It is then possible to study various 
physically interesting quantities (e.g. energy spectrum, correlation functions, 
etc.) in the path-integral formalism by computer simulation based on the 
Monte Carlo method. 

As with any cut-off prescription, considerable freedom remains in the 
lattice formulation. In the limit of vanishing lattice spacing, the physics of a 
renormalizable field theory should be independent of the details of the 
regulator; it should, so to speak, lose the memory of the lattice spacing. This 
means that in this limit the coherence length of the theory should be infinite 
when compared to the lattice spacing. In the language of statistical 
mechanics the divergence of the correlation length corresponds to a second- 
order, or continuous, phase transition. If a model has only a first-order 
transition, the coherence length never becomes infinite and the desired 
continuum theory does not exist. Furthermore, analytic results in the strong 
and weak coupling regimes can easily be established. We need to ascertain 
whether these properties are connected continuously in the theory, i.e. 
whether there are phase transitions at intermediate couplings. Thus it is 
important to study the phase structure of lattice field theory. At present most 
of our knowledge of the phases of QCD is obtained in various numerical 
studies of the theory. 

There are two popular methods for introducing the lattice in field theory. 
In the Euclidean lattice formulation both space and time are discretized 
(Wilson 1974). Here after a Wick rotation (to Euclidean space) the 
quantization is performed via the path-integral formalism. This method has 
the advantage of keeping some vestige of the original Lorentz symmetry and 
this allows for a particularly elegant formulation. Also, as we shall see in 
the case of lattice gauge theory, there is no need to introduce the gauge- 
fixing terms. The other method is the Hamiltonian lattice formulation in 
which only the spatial dimensions are discretized in a Minkowski space-time 
(Kogut and Susskind 1975a). The theory can be canonically quantized with 
the usual Hamiltonian formalism. This has the advantage that some physical 
quantities (especially the mass spectrum) are more directly accessible to 
computation. Our discussion will follow the Euclidean formulation with a 
space-time lattice spacing a. The lattice site will be labelled by a four-vector 
n. The four-dimensional integration will then be replaced by a sum 


[ats ay (10.208) 
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Scalar fields. We first study the simplest case of scalar field @(x). The con- 
tinuum field theory in Euclidean space has the action 


S(p) = [aneo + V(¢)] (10.209) 
where 
À 
V() = =m? + 4 ht. (10.210) 
The scalar field @ exists at every lattice site n, 
p(x) > dy. (10.211) 
The derivative is replaced by 
l 
ô P(x) TA a (Pasi 7 Pn) (10.212) 


where jis a four-vector of length a in the direction of u. The lattice action is 
then 


S(¢) =) 5 2 (Onti — Pn)? + “(> p? + . os) (10.213) 


It is instructive to go over to momentum space to see the spectrum of the 
(A = 0) free-field theory. For this we take the Fourier transform 


d*k . 
al j ayi oH). (10.214) 


Since it is meaningless to consider wavelengths less than twice the lattice 
spacing, the above integral is taken over only one ‘Brillouin zone’ of the 
reciprocal lattice. Thus, 


2 <k,< id for each u (10.215) 
a a 


where k, = k:u. After substituting eqn (10.214) into eqn (10.213), we have 
factors such as those coming from the kinetic energy term 


d*k d*k’ . : l 
4 i(k+k')'n(aiak, __ 1 iaka l 
wd | OntQne® C rT DeD 


dtk 
= |S (e'u _ 1)(e7 iku Pe 1) 


dtk . 
=4 las sin? (ak,/2). 
The free field action can then be written 


1 { d*k 4 . (ak, 5 
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Thus each mode contributes to the action in the momentum space a 
quantity 


See ee ae sin(a) 
ra 2 


rather than the standard m? + k*. Nevertheless, these two expressions have 
the same continuum limit as they coincide at the minimum value of k = 0 (see 
Fig. 10.17(a)). 


(a) 


Fic. 10.17. The dispersion relation S(k) for free (a) scalar and (b) fermion systems. The dashed 
curves k* are for familiar continuum theories. The solid curves correspond to the latticized 
systems. 


The lattice action (10.213) can be quantized by using the Feynman path- 
integral formalism in which the expectation value of a product of fields is 
given by 


l 
COlOn,Pnz-+ - Pn l0> = 7 [t [db }(bn,Pn, -- -ne (10.217) 
where 
Z = jo [d¢,]e >. (10.218) 


The meaning of the integrals should be clear as we may recall that the usual 
functional integrals are actually defined on a discretized space-time lattice 
and an appropriate continuum limit is taken at the end. If we rescale the 
fields as 


On = VAGns (10.219) 


the lattice action becomes 
l 
S($) = ~ S9) (10.220) 


where 


2 


: l 
so) =F É E (Orra — $n)? + at (= 245 os) (10.221) 


n 


i.e. the coupling constant å has become an overall factor in the action. In this 
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way eqns (10.217) and (10.218) may be written as 


I 
OlPn Pn- On l0) = z i KOA LOR E Pn) 
x api so} (10.222) 


l 
Z' = {I [d¢,,] exp) = so} (10.223) 
We note that (10.223) has the same structure as the partition function in 
statistical mechanics if we make the identification 


l l 


M 


(10.224) 


Thus the strong coupling expansion (in powers of 47+) will correspond to the 
high-temperature expansion in statistical systems. 


Fermion fields. We now consider fermion fields on a lattice. The same 
procedure will yield a Euclidean lattice action for the free-fermion system 


3 4 
SoY) = X P 2 WnYal Wna is Wn—a) = manda} (10.225) 


The y matrices are Euclidean, that is 
Yur Vy} = 2 Ouy: (10.226) 


Eqn (10.225) has the momentum space form 


d*k sin ak 
S = |— W(—-k)s1 
which yields a dispersion relation 
. 2 k 
S(k) = > Atm? (10.228) 


shown in Fig. 10.17(b). There are now two equal minima in a Brillouin zone. 
One is located at k = 0 and gives the correct continuum limit. The other 
mode at k = +7z/a carries an infinite momentum as a —> 0 and yet can be 
excited for finite a. The fermion degeneracy, i.e. the doubling of the number 
of fermionic states, must be suppressed with some appropriate modification 
of the latticized theory. This is permissible as long as the continuum limit is 
not affected. Clearly many degeneracy regularization procedures are 
possible; we will present the one due to Wilson, who proposed the simple 
addition of a non-local factor of 


AL = Pansa + Vann — Ur) (10.229) 
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to the lattice Lagrange density. Thus the free fermion action takes on the 
form in Euclidean space 


n 


+ (1 a Yan- — 2p] Fe ma Fn (10.230) 


which has the momentum-space representation 


d*k _ in ak 
So(W) = [Soro id 7, ne 
tims rA yh, (10.231) 


This increases the unwanted minimum without affecting the small-k 
behaviour and the k = 0 minimum will be only one to survive on the 
continuum limit. 


Local gauge invariance and the QCD action 


As we have illustrated above, considerable freedom exists in lattice 
formulation. One is free to add to the Lagrangian terms which will not 
contribute in the continuum limit. Using this freedom, Wilson has presented 
a particularly elegant lattice formulation for gauge theories. His prescription 
keeps local gauge invariance as an exact symmetry in a mathematically well- 
defined system. 

Recall our discussion in §8.2 of the geometric interpretation of gauge 
invariance. When a material particle undergoes a parallel transport along a 
world line C from x to x’, it can be represented by a ‘non-integrable phase 
factor’ of the wavefunction (i.e. for every path we can associate a group 
element as in eqn (8.65)) 


U(x’, x) = exp ig fr ‘A,(y) ay (10.232) 
C 
Thus, for a gauge transformation, specified by the gauge function 
O(0,) = exp{iT :8(x)} (10.233) 
we have 
Yx) > DOA) (10.234) 
W(x) > WADO) 
and 
U(x', x) > O(6,.)U(x’, x)®"(0,.). (10.235) 


(Please note the notation changes for the gauge transformation and parallel 
transport matrices from the notation used in §8.2.) 
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The lattice version of gauge transformation in (10.234) and (10.235) may 
be written as 


WV, > OW, 
Y, > w,o' (10.236) 
and 
U(n + Â, n) > ©, ,,U(n + fi, n)®} (10.237) 
where, for the SU(3) gauge symmetry, 
Ae. 
0, = epli J at (10.238) 
with 4f, i = 1, 2,..., 8, being the usual Gell-Mann matrices and 
WE 
U(n+ fi, n) = exp ig z Aiah (10.239) 


This is the lattice version of the parallel transport matrix between adjacent 
sites n —> n + pf and is usually called a link variable. From (10.236) and 
(10.237) we have „U (n,n + fi)W,+4, as a gauge-invariant combination. This 
suggests the modification of (10.230) to obtain the quark part of the SU(3) 
gauge-invariant QCD action 


Soco = S(q) + S(A) (10.240) 
3 
S@=> P S Pal + y,)U (n,n + Ânn. 


n 


+ (1 = y,)U@, n — Apni + 2p] — ma Vat (10.241) 


What about the lattice action for the gluon field S(A)? Clearly this term must 
be composed of link variables only. The simplest gauge-invariant combina- 
tion will be four-link variables. (As they are matrices in SU(3) space two- 
and three-link combinations are trivial because of unitarity and deter- 
minantal constraints.) This suggests that 


—] 
S(A) = 25 X tr U, (10.242) 
29° S 


with 
U,=U(n, n+ U(n+ fi,nt+ f+vU(n+ ft+v,n+ v)Uln +v, n) 
(10.243) 


which is a product of four-link variables around an elementary square 
(called a plaquette; see Fig. 10.18). The sum in (10.242) is over all plaquettes 
of the lattice. From our discussion of a parallel transport around a square 
given in §8.2, it follows that (10.242) may be written as 


Bea 1 , 
S(A) = ay >» tr{exp(iatgF,, „)} (10.244) 
Pp 
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n+v n+utv 


n n+p 
Fic. 10.18. A lattice plaquette. 


with 
Eas = Oars ~ OA nu i igl An, n» Aiad 
l 
E 7 (Anta v — Any) (10.245) 


where A,, , is the gluon field A‘/'/2 at the site n. The continuum limit can be 
recovered immediately as 


sy -aty hi -Lgtr pie 4 
52 5 g Fas Bec 
l ne 
= | d*xFi Fie (10.246) 


where we have used the fact that the As are traceless (so the linear F,,, term 
disappears) and tr(/'A’) = 2 6%. 


Confinement criteria, the Wilson loop 


To see whether QCD has confinement one can study the energy of a system 
composed of a quark at x = (t, 0) and an antiquark at x = (t, R). If there is 
no confinement, we expect 


E(R)-> 2m for R> œ (10.247) 


where m is the quark mass. Confinement implies that the interquark potential 
energy grows without bound, 


E(R)> æ for R->o. (10.248) 
We can represent the q state at time ¢ as (see, for example, Bander 1981) 
la, O4, R)> = X, f (CEC, R), (2, 0); CJ10> (10.249) 
C 

where I'[x’, x; C] is the gauge-invariant qq operator 

TEx’, x; C]= qx) UC, x; C)q(x) (10.250) 
with 

U(x', x; C) = exp4ig | XAO) a} (10.251) 


x 


and C is a path joining x and x’. Now consider the overlap between the qq 
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state at ¢ = 0 and the qq state at t = T, 

Q(T, R) = <OIT'E(O, 0), (0, R); CIT[(T, 0), (T, R); CJO>. (10.252) 
Inserting a complete set of energy eigenstates, we obtain in the Euclidean 
space 

Q(T, R) = > |<O|T"L, 0), (0, R); C]|n>|? e~ 7. (10.253) 


For large T, the smallest E, will dominate. This smallest energy eigenvalue 
corresponds to the potential energy of the qq system separated by a distance 
R 


lim Q(T, R) ~ eo 2", (10.254) 
T> 0 
In terms of the quark fields the overlap function Q(T, R) may be written as 
Q(T, R) = ¢0|q(0, R)ULO, R), (0, 0); C}q(O, 0) 
q(T, 0)U [(T, 0), (T, R); C]q(7, R)]O>. (10.255) 


If we treat the quark fields as external sources (as in the case of heavy quark 
states), then the quark propagator (in a background of gluon fields) can be 


expressed as 
t’ 


<Olg*(t’, x)G*(t, x)]0> = exp i | Ao(t, xX) ar [core s X)AŽ(E, X)0 free 


t 


~ U[(t', x), (t, x); C] ô? e™™"". (10.256) 
Combining (10.256) and (10.255), we have 
Q(T, R) ~ e7?""W(C) (10.257) 
where 
W (C) = <Oltr U[x, x; C]|O>. (10.258) 
Here C is the rectangular path of Fig. 10.19. The correlation function W(C) 
(7,0) (T,R) 
(0,0) (0, R) 


Fic. 10.19. A Wilson loop. 
is called the Wilson loop and its behaviour determines the confinement 
property, as a comparison of (10.257) with (10.254) shows that 
lim W(C) ~ e7 TER -2m (10.259) 


T= œ 


As we shall illustrate presently, in the strong coupling limit, the Wilson loop 
of lattice gauge theory obeys an area law so that for a large contour 


W(C) ~ exp{ — KA(C)} (10.260) 


where K is a constant and A(C) is the area of the surface enclosed by path C. 
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Thus for the rectangular path of Fig. 10.19 
A(C) = TR. (10.261) 


Substituting eqns (10.260) and (10.261) into eqn (10.259) we have the linearly 
rising potential (10.202) for the qq system. Also, we can identify the constant 
K in (10.260) as the string tension. Note that we treat the quarks as a mere 
external colour source. Thus using the Wilson loop it is possible to study the 
question of colour confinement in a pure gluon theory without quark fields. 


The area law in the strong coupling limit 


In eqns (10.222) and (10.223) we displayed the path-intergral formulation of 
the correlation function in a scalar field theory. The Wilson loop may be 
similarly expressed in a pure gluon QCD; it will be a functional integral over 
gluon fields. Since the (infinitesimal) link variables are directly related to the 
gauge fields, we can choose to work with the link variables as the basic 
dynamic degrees of freedom in a lattice gauge theory. As we shall see, this 
allows for a simpler formulation of eqn (10.258) 


l 1 
WO) = 5 [| dU@, n + Ô) tr U(x, x; ©) api- sr Dt Up}. 
n, u g P 


(10.262) 
where 


Z= [1 dU(n,n + ji) exp) — ar Dt Vp}. (10.263) 
n, u P 

Note that no gauge-fixing term has been added to the action because the 
link variable (i.e. lattice gauge field) has only finite range. The volume in the 
path-integral space generated by all possible gauge transformations is finite. 
Hence no convergence factor corresponding to the gauge-fixing term will be 
necessary before passing on to the continuum limit (see §9.1). The link 
variables are group elements of SU(3). We may set up a generalized Euler 
angle representation for these SU(3) unitary matrices and determine 
explicitly the form of the group integral in terms of the eight Euler angles. In 
any case we have the orthogonality properties 


| dU(n,n + ALU, n + Aly = 0 (10.264) 
[avo n + ÒCU(n, n + AJyLU'n, n + Da =3 ôu (10.265) 


favo n+ ALU(n, n + ALU (n, n + Aa = 0. (10.266) 


The identity (10.265) implies that two links going in opposite directions will 
cancel each other after integration. Thus, if we have two adjacent plaquettes 
oriented in the same direction, they will merge into a rectangular path after 
integrating over the common link as illustrated in Fig. 10.20. In the strong 
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n+p 
| mete {4 +} ; | 
n 


Fic. 10.20. 


coupling limit we have 1/g? as a small parameter and the exponential in 
(10.262) can be expanded 


W(C) = 5 [avo n + pt) tr U(x, x; C) 


l 1/1/\ 
[i= se dry, +35 (5) LL Uptruy, +...| 
(10.267) 


For simplicity we will take a planar rectangular path C as in Fig. 10.19. It 
is clear that in this strong coupling limit the orgothonality properties of the 
link variables given in eqns (10.264)-(10.266) imply that the lowest-order 
nonvanishing contribution to W(C) is the (1/g*)"” term where N, is the 
minimal number of plaquettes required to cover the area enclosed by the 
path C, 


1 \* 
W(C) ~ (=) (10.268) 
g 
This corresponds to the area law since the area enclosed by the path C is 
given by 
A(C) = a?N,. (10.269) 


Hence, 


W(C) ~ (g? 40 = exp{ —(TR In g°)/a?} (10.270) 


or, comparing this with (10.260) we have the linearly arising potential 


E(R) = KR (10.271) 
with 
In g? 
K= at (10.272) 


The weak coupling expansion of the Wilson loop can also be considered by 
first passing to the continuum limit and replacing the action by the Gaussian 
approximation. We then obtain a perimeter law. It turns out that this is the 
familiar Coulomb’s law, E(R) ~ R™', in disguise. With the appropriate 
renormalization effects taken into account the property of asymptotic 
freedom is recovered. 
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Does this mean that we have proven that QCD possesses both properties 
of asymptotic freedom and colour confinement? No, not yet, as we note that 
the U(1) lattice gauge theory also has both the perimeter (i.e. Coulomb’s) law 
and the area law (confinement)—the above strong-coupling result 1s derived 
without using the non-Abelian nature of the theory. The catch is that the 
weak and strong coupling regimes may be separated by one or more 
discontinuous phase transitions. It has been proven that in four dimensions 
the Abelian gauge theory has a non-trivial phase structure, but similar 
analytic proof that QCD does not undergo a phase transition at some finite 
couplings has not been obtained. However this problem can be investigated 
using numerical analysis. In particular two approaches have been very 
successful. The first involves the use of Pade techniques to extrapolate from 
the strong coupling expansion to a regime where weak coupling predictions 
become valid (Kogut, Pearson, and Shigemitsu 1979). The second approach 
is Monte Carlo simulation in which the path integral (10.263) is considered as 
a partition function for a statistical system (Creutz 1979). Various correla- 
tions can be calculated by first generating configurations typical of the 
system in thermal equalibrium. The results from both methods indicate that 
no phase transitions occur in the intermediate coupling region. The strong 
coupling behaviour (10.272) g?(a) ~ e“” does go into the weak coupling 
g*(a) ~ 1/0n a~') as a > 0. It is found that the transition is smooth and 
rapid (see also Kogut et al. 1981), just as a similar calculation for U(1) theory 
shows convincing evidence for a phase transition. Furthermore, these 
methods are able to produce satisfactory numerical relations between the 
long-distance parameter string tension and the gauge coupling at short 
distance (or the conventional QCD scale parameter, 100 MeV < A < 
300 MeV). Thus these numerical results are encouraging in indicating 
that asymptotic freedom and colour confinement do coexist in a single phase 
of QCD. 


11 Standard electroweak theory |: 
basic structure 


IN this chapter and the following one we shall present the standard gauge 
theory of weak and electromagnetic interactions. It combines in one 
framework quantum electrodynamics and the low-energy V—A theory of 
weak interactions for charged currents. The unified theory is renormalizable. 
It also predicts a new set of neutral currents; its successful experimental 
confrontation in recent years strengthens our confidence as to the correctness 
of the theory. Chapter 11 will serve as an introduction and will emphasize the 
theoretical structure of the model. In Chapter 12 the phenomenological 
implications of the model will be presented. Possible extension and 
modification of the standard electroweak theory will be discussed in Chapter 
13. 


11.1 Weak interactions before gauge theory 


In this section we shall provide a brief review of weak-interaction theories 
before the advent of gauge models: the four-fermion theory as well as the 
intermediate vector boson theory. We shall discuss in particular the 
difficulties encountered if they were taken to be self-contained theories. 


Four-fermion interactions 


Soon after Pauli’s neutrino postulate, Fermi (1934a,b) proposed his theory 
for the B-decay n > pev: 


Gr __ E 
Lex) = =- BOE) vO] + h.c. (11.1) 
where the fermion field operators are denoted by their particle names, and 
Gr =~ 10° °/m; (11.2) 


is the Fermi coupling constant with m, being the proton mass. 

In the ensuing years other processes such as the n-p and p-e decays have 
been discovered and found, like B-decays, to have a comparatively long 
lifetime. The concept of a distinctive class of interactions, the weak 
interactions, began to emerge. The surprising discovery of parity non- 
conservation (Lee and Yang 1956; Wu et al. 1957) stimulated a great deal of 
research and the eventual formulation of the V-A theory (Feynman and 
Gell-Mann 1958; Sudarshan and Marshak 1958; Sakurai 1958). It 1s 
suggested that an effective Lagrangian, very much like the one in eqn (11.1), 
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describes the weak interactions 


G 
LX) = e J‘ (x) Jx) + h.c. (11.3) 
with the weak current J (x) being of the vector-minus-axial-vector form. 
If we separate in the current the leptonic and hadronic parts, 


J y(x) = Jig) + nal) (11.4) 
the leptonic current J}(x) can be written directly in terms of lepton fields 
Ji(x) = Vey (l — ys)e + Wy*(1 — ys)p (11.5) 


and the hadronic current J/(x) can be decomposed into parts having definite 
flavour SU(3) transformation properties as in eqn (5.79) and can be neatly 
summarized by writing the current directly in terms of quark fields 


Ji(x) = ay*(1 — ys) do (11.6) 
with 
d, = cos 0, d + sin 6, s (11.7) 


where 0, is the Cabibbo angle ~ 13°. When compared eqn (11.6) to (11.5), 
lepton—quark symmetry then suggests the generalization of (11.6) to 


Ji(x) = üy (1 — ys) dg + Cy*(1 — Y5)So (11.8) 
with 
Sọ = cos ĝ, s — sin 0, d (11.9) 


where the c-field is the postulated new heavy quark, the charmed quark 
(Bjorken and Glashow 1964). More importantly, it has been shown that any 
sensible weak-interaction theory must have this extra hadronic current in 
order to suppress to an acceptable level the induced strangeness-changing 
neutral-current effects (Glashow, Iliopoulos and Maiani 1970). This suppres- 
sion mechanism, although invented before the general acceptance of gauge 
theories, can best be explained in this new context and we shall do so in §11.3 
and in more detail in §12.2. 

We should take note of some of the common properties of the weak 
currents as given by eqns (11.5) and (11.8). They are charged currents, all 
with one unit of charge; in the lowest order there are no neutral-current 
processes such as the reaction v, N —> v,N. They are bilinear in the 
fundamental fermion fields involving the helicity projection operator 
(1 — y;). We can rephrase this by saying: that only the left-handed (LH) 
fermions are present in the weak currents. Thus, 


Wy*(1 — ys) = Ay h (11.10) 
with 


L=30l—ys)W, Wr=3l + y5)W (11.11) 
and 


w= t+ Wr. (11.12) 
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The rule that ‘parity violation is maximal in weak interactions’ has a simple 
interpretation in terms of the lepton and quark fields: in charged-current 
weak interactions the fundamental dynamical degrees of freedom for matter 
are the two-component left-handed fermion fields. 

If we restrict our applications to the leading order in Grp, the V-A 
Lagrangian of (11.3) is very successful in describing a vast amount of low- 
energy weak-interaction experimental data. The exceptions are few. It is not 
clear whether nonleptonic weak decays are correctly described by a 
J,,(x)J#(x) coupling as required by (11.3) because of our inability to do 
reliable strong-interaction calculations. The CP violation discovered in the 
neutral K-meson system (Christenson, Cronin, Fitch, and Turlay 1964) 
cannot be incorporated in any simple fashion. The most successful pheno- 
menological theory of CP nonconservation postulates the existence of a new 
super-weak interaction (Wolfenstein 1964). In short, the lowest-order V-A 
theory correctly describes the domain of weak-interaction phenomena in 
which one believes it should be applicable. 

Nevertheless the Lagrangian in (11.3) cannot be taken as a self-consistent 
quantum field theory of weak interactions: it is not renormalizable and even 
in the lowest order of Gp it violates unitarity at high energies. 


(1) Lack of renormalizability. The interaction in (11.3) 1s not renormaliz- 
able. It is a dimension-six operator, or, more transparently, the coupling 
constant Gr has dimension (mass)~*. Thus the higher-order contributions 
are increasingly divergent and they cannot be organized in such a way as to 
be absorbed in a few ‘bare’ quantities (see Chapter 2, especially §2.4). In a 
nonrenormalizable theory, even though the coupling may be small, there is 
no guarantee that the higher-order terms are not large. Then the whole 
corpus of selection rules (e.g. the absence of AS > 1, AS = — AQ charged- 
current processes) which is the basis of the successful V—A theory will be 
made meaningless if the lowest-order Born terms do not dominate. 

(2) Violation of unitarity. Even if we restrict ourselves to the Born 
approximation there are certain processes which will violate unitarity. 
Consider the reaction of v, e > pv, as described by the effective Lagrangian 
of (11.3): the amplitude has only the J = 1 partial wave and the high-energy 
cross-section o ~ GẸs, with s = 2m,E (E being the v, energy in the laboratory 
frame). However unitarity requires that o(J = 1) be bounded by s~*. Thus 
for energies above ./s ~ G;‘/* ~ 300 GeV the theoretical cross-section 
from (11.3) would violate unitarity. (For details see §11.2.) 

It turns out that these two problems are closely related. If the lower-order 
diagrams have bad high-energy behaviour to violate unitarity, the higher- 
order contributions, which are integrals over lower-order diagrams, cannot 
be renormalized. 


Intermediate vector boson theory (IVB) 


Like the electromagnetic current, the weak current in LY. (eqn (11.3)) 
transforms as a four-vector under the Lorentz transformation and one may 
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imitate the successful QED by introducing a new massive field W, and write 
the basic interaction as 


LY, = g( J,W" + hc). (11.13) 


Then the four-fermion Lagrangian (11.3) can be viewed as the effective low- 
energy theory generated by & in second order (see Fig. 11.1) with the 


g 
: ox omen 
W —_—_ 
£ 


Fic. 11.1. The four-fermion interaction as the low-energy approximation to the g? diagram in 
IVB theory. 


identification g*My? = 27t Gp, as the massive IVB propagator con- 
tributes the My” factor. Since QED is a gauge theory, one may interpret 
this interaction Lagrangian as resulting from a Yang-Mills construction— 
but with gauge bosons being massive. 

Let us now examine the problems of unitarity and renormalizability in IVB 
theory. The problem with unitarity remains: but it is shifted to some other 
processes. For example the reaction vý = W* W_- with longitudinally polar- 
ized Ws can be shown (see §11.2 below) in the IVB theory to have a high- 
energy amplitude ~ GpE? (E being the neutrino energy) in the pure J = 1 
partial wave, which unitarity requires to be bounded by a constant. 

Even though the coupling g is now dimensionless, this theory is still not 
renormalizable. The free massive vector boson Lagrangian is 


Ly = —4}4 (ô Wt — ô ,WE(OFW” — 80W”) + Me WIW” (11.14) 
which gives the propagator in momentum space 


. . Juv ~ k,k,/M$ 
iA „ (k) = —i Pai (11.15) 
This propagator behaves like a constant as k — oo and the interaction is not 
renormalizable by power counting. The problem of course lies in the IVB 
mass, which gives the term k,k,/M¥ in the propagator. However just this 
massiveness of W is required in order to yield the desired low-energy four- 
fermion theory. The key problem is then how to introduce gauge-boson mass 
terms in the Yang-Mills theory without spoiling renormalizability. 


1.2 Construction of the standard SU(2) x U(1) theory 


As discussed in the last section, weak interactions must involve massive 
intermediate vector bosons and yet their mass terms spoil renormalizability. 
This serious contradiction was finally resolved with the emergence of the 
spontaneously broken gauge theory. Also, the correct renormalizable theory 
displays the unity of weak interactions with electromagnetism. The required 


340 Standard electroweak theory I: basic structure 11.2 


gauge symmetry is SU(2) x U(1). This model is now the standard theory of 
the electroweak interaction. 

Noting the vectorial nature of both interactions, Schwinger (1957) was the 
first to advance the idea of weak and electromagnetic unification. Glashow 
(1958) suggested that the desired renormalizable theory of weak interactions 
would involve this unification, and later he (Glashow 1961) proposed such a 
model which has the SU(2) x U(1) gauge symmetry. Renormalizability is 
not preserved in his theory as the IVB masses were inserted by hand. A 
similar attempt was also made by Salam and Ward (1964). Finally the 
renormalizable theory with IVB masses generated by the Higgs mechanism 
was proposed by Weinberg (1967) and also discussed independently by 
Salam a year later (1968). Thus the standard theory is often referred to as the 
Weinberg—Salam model or the Glashow—Weinberg—Salam model. However, 
the importance of this approach was not recognized by the general 
community of high-energy theoretical physicists until °t Hooft (1971la,b) 
transformed the subject by proving the renormalizability of gauge theories, 
with and without spontaneous symmetry breaking. 

In the following discussion we shall present the Weinberg—Salam theory. 
For simplicity we shall restrict our initial consideration to the lightest 
fermion ‘family’ or ‘generation’ (v,, e, u, d). As we shall show, a theory in the 
one-family approximation is completely self-contained and self-consistent. 
The heavier families (v,, p, c, s) and (v,, T, t, b) will be given structures 
identical to the light one. Their incorporation into the standard theory is 
discussed in §11.3. 


Choice of the group SU(2) x U(1) 


The algebraic approach. To motivate the choice of the gauge group we need 
only to consider an IVB theory with an electron and its neutrino. The weak- 
interaction Lagrangian is given by eqns (11.13) and (11.5) 


Ly = G(J,W* + h.c.) (11.16) 
where 
J, = Vey,(1 — Y5) e (11.17) 


is the V-A charged current. On the other hand the electromagnetic 
interaction of these leptons is given by 


Lom =e JS" A? (11.18) 
where 
J =e y€ (11.19) 


is the electromagnetic current. In a unified gauge theory of weak and 
electromagnetic interactions, we need at least three vector gauge bosons (W+ 
and the photon) to couple to the currents J, Jt, and J™. The simplest group 
with three generators is SU(2). However, as we shall demonstrate imme- 
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diately, the three currents listed in eqns (11.17) and (11.19) do not close 
(to form an algebra) under commutation. Define the weak and electric 
charges as 


T,(t)=4 | ano | aidan 
T_(t) = T(t) 
Q(t) = [axa S | d?x et e. (11.20) 


Using the canonical commutation relations for fermions 


{Wi (x, t), w(x’, t)} = 6; 6°(x — x’), 
we can show that 
[T,(¢), T-()] = 2T3(¢) (11.21) 
with 


T3(t)=4 fassia — Ys)Ve — e'(1 — ys) e]. (11.22) 


T, # Q and thus T,, Q do not form a closed algebra. The reason behind this 
is not difficult to see. In order for Q to be a generator of SU(2) the charges of 
a complete multiplet must add up to zero, corresponding to the requirement 
that the generators for SU(2) must be traceless. In the case at hand we are 
attempting to form a doublet out of v, and e which clearly do not satisfy this 
condition. Also, T, (t) are of the V-A form while Q is purely vector. 

There are two alternatives at this point. 


(i) We can introduce another gauge boson coupled to T, as given in eqn 
(11.21). These four generators can now form the group SU(2) x U(1). This is 
the choice we shall adopt eventually. 

(ii) We can add new fermions to the multiplet and thus modify the 
currents in order that the new set of T, and Q will be closed to form SU(2) 
under commutation. In our case we may attempt to form a triplet with e, v., 
and a new charged heavy lepton E*. Such a theory has in fact been 
constructed by Georgi and Glashow (1972a). (They also introduced a neutral 
heavy lepton N in order to obtain the V—A form for the weak current at low 
energies.) In this model, 


Et 
4(1 —y;)| v.cosa+Nsina f 
e 
Ee 
(1 +y N p 


e 
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and 
4(1 + y5)(N cos a — v, sin «) 


are the two triplets and one singlet in the model. This yields the weak charge 
T,(t)=4 [arse *(1 — ys)(v. cos « + N sin a) 


+ (vicos a+ Nf sin a)(1 — y) e 
+E(1 + y;)N + N'(1 + ys.) e]. (11.23) 
It is then straightforward to see that 


[T+ T-(J = 20) 
with 


OQ = | d3x[E'E — ef e]. (11.24) 


Clearly the only neutral current in this case is the electromagnetic current and 
the model was ruled out by the discovery of weak neutral-current effects in 
1973. Furthermore, it is difficult to incorporate fractionally charged quarks 
into such a model. 


The unitarity argument. Equivalently we can argue from unitarity that it is 
necessary to introduce either a new charged lepton or a new neutral gauge- 
boson. Consider the reaction vy > WtW- with both Ws being longitudin- 
ally polarized. The lowest-order amplitude is given by Fig. 11.2 


vel?) wtik.e) 


VP’) W (k'e) 


Fic. 11.2. vy > W*W7 with a t-channel exchange of leptons. 


T(vý > WWT) = —id(p’)(— ig#)(1 — y5) 


x p—-k—-m (—ige)(1 — ys)u(p) 


sei Vor | es 
= - 29769 =P — os Ys) u(p). (11.25) 


The polarization vectors é)(k) with e®-e = —6,, and k-e® =0 may be 
chosen in the rest frame of the W boson as Ẹ = 0 and e? = 6,;. To obtain e” 
for a moving W boson: k, = (E, 0, 0, k) with k = (E? — Mẹ)? we can make 
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the appropriate Lorentz boost along the z-axis. The transverse polarizations 
will not change under such a Lorentz transformation while the longitudinal 
polarization vector becomes e%) = Mw'(k, 0,0, E). In the high-energy limit 
with k = E — My/2E +... the e\?) vector can be approximated as 


eP = k,/My + O(My/E). (11.26) 
Thus, substituting (11.26) into (11.25), we have 
T, ~ —2g?/(k? — 2p -k)o(p')(K'/Mw) 
x (P — k)(k/My)(1 — y5)u(p) 


2 


2 
~ Fz PPRA — vsulp) (11.27) 


where we have used the result (p — k)ku(p) = (2p:k — k*)u(p). To show 
more explicitly that this amplitude is a pure J = 1 partial wave, we can 
choose the following momentum configurations 


P, = (E, 0, 0, E), Pp, = (E, 0, 0, — E) 


k„ = (E, ke), ki, = (E, —ke) 
with 
e = (sin 0, 0, cos 0). (11.28) 
Since v and v have opposite helicities, we have 
up) = VE( pp r= VE(, Jer (11.29) 
oP z 


i(p’) = VE tial | -1)= JExt(—6,,-1) (11.30) 


l 0 
Kj = 0) KAZ | 


and thus the combination in eqn (11.27) becomes 


öp K'A — ys)u(p) = Exia(— 1, — 1) 


[E ko-e\( 1 -1\/ 1 
alee aes Ne 


= —4EFy\ (E—ke-e)y_1. =4Eksin@. (11.31) 


where 


We have T, ~ GĻE? sin 0 as E > œ. The partial-wave expansion for the 
helicity amplitude is (Jacob and Wick 1959; see also Frazer 1966) in this 
case 


Oo 


Ti 544,4,4,(£> 9) = ` (2J + 1) Ty 424, 4ap(E)di.4(9) 


J=M 
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where A, = —A, = 1/2 and A; = A, = 0 are the helicities of the initial and 
final particles with A= A, —A, = 1, u = å} — å, = 0 and M = max(A, u) 
= 1. d},(0) is the usual rotation matrix with d1o(0) = sin 0. It is clear that T, 
corresponds to a pure J = 1 partial wave and violates the unitarity bound of 
T’~‘(E) < constant. To cancel this bad high-energy behaviour, we need 
other diagrams for this reaction. There are two possibilities: s-channel or u- 
channel exchange diagrams. (The t-channel diagram will not help as it gives 
the similar contribution as before with the same sign.) 

The heavy-lepton alternative; the u-channel exchange diagram in Fig. 
11.3(a) yields the amplitude 


Ap — k'e (1 — ys) 
T (vý > WWT) = —2g olp) ———— -> u 
( ) g (p^) (p — k’)? — m2 (p) 
meee 
a v(p')k'(1 — ys)u(p). (11.32) 
Ww 
v(p) Wt (k) v(p) W*(k) 
Z 
Et 
vp’) W(k’) v(p') W~(k’) 


(a) (b) 


Fic. 11.3. vv > W*W7 with (a) u-channel and (b) s-channel exchanges. 


If g? =g’*, this will cancel the bad high-energy behaviour given in eqn 
(11.27). 

The neutral vector boson alternative; the s-channel exchange diagram in 
Fig. 11.3(b) yields the amplitude 


TAV > W*W>) = iP AiD — VUP) Laut (EK) 
x i[—g”® + (k + kk + kYP/M2]I/Elk + kY — M2]. (11.33) 
Choosing the ZWW coupling to have the Yang-Mills structure 
Lans = —if Lk! — K)aguv — OK! + K)yJau + (2k + kyGavls (11.34) 


we get 
ee’ Luy = —if [(k’ — k) £) — 2(k' «eet, + 2(k ee] 
~ -f [(k — k’),(k k]. (11.35) 
Hence 
T, = <2 DOKA — 75)u(p. (11.36) 


If we choose ff’ = 2g’, this will also cancel the amplitude in (11.27). 
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In fact if one demands that all the amplitudes which violate unitarity be 
cancelled out, one ends up with a renormalizable Lagrangian which is the 
same as the one derived formally from the algebraic approach. (For 
details of such a construction see Llewellyn Smith 1973; Bell 1973; Cornwall, 
Levin, and Tiktopoulous 1974, 1975.) 

With the choice of the group SU(2) x U(1), it is straightforward to write 
down the gauge-invariant Lagrangian 


Lı = —4Fi FH —4G,,G" (11.37) 
where 
Fi, = 0,A, — 0,Ai, + ge*ALAS, i=1,2,3 (11.38) 
and 
G,, = „B, — 0,B, (11.39) 


are the SU(2) and U(1) gauge-field tensors, respectively. Clearly before 
spontaneous symmetry breakdown this corresponds to four massless gauge 
bosons. 


Fermions (in the one-family approximation) 


In this section we shall study the standard model with its fermionic sector 
composed of e, v, leptons and u, d quarks only. As it turns out such a 
simplified theory 1s completely self-contained. Heavier fermions such as 
u, S,c,..., etc. will be incorporated in §11.3. 

As we have already mentioned in §11.1 the basic fermion dynamic degrees 
of freedom are the two-component fields with definite helicities. More 
pertinently, since gauge interactions conserve helicity, we can and should 
make independent choices for left-handed and right-handed fermions. Thus 
the first family is composed of the following 15 two-component fermions 


W = VeL, EL, ER, UL, Up, di, dp (11.40) 
with 
e, = (1 — 75)e 
er = 4(1 + y5)e, etc. (11.41) 
The colour indices a = 1, 2, 3 on the quark fields have been suppressed. 
SU(2) x U(1) quantum number assignment. From eqns (11.5), (11.7), 


(11.20), and (11.22) we have learned that the SU (2) group is generated by the 
weak charges 


Tos | (vieL + utdi) d?x 
T- = (T,)' 


T, = § | 40 —efe, + ufu, — dfd,) d3x. (11.42) 
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From these expressions for the SU(2) generators, it is clear that 


l = z and q= H (11.43) 
eL di 


are SU(2) doublets and eg, ug, and dr are singlets. The U(1) group should be 
chosen in such a way that the electric charge Q 


Q = [e + ĝuu — 4d'd) d?°x 


2 1 
= [ee — eker + ulu, + ukur = ġdid, 


— 4 dkdp) d?°x (11.44) 


can be a linear combination of the U(1) generator and T} of SU(2) in eqn 
(11.42). We observe that the combination 


Q- T,= |r- 4(viver + ele.) + (uju, + didi) 
Tt 


— eber + ulur — 4did,] d?x (11.45) 


has the property of giving the same quantum number to all members of an 
SU(2) doublet in (11.43). Clearly it commutes with all the SU(2) generators, 
l.e. 


[Q — T3, T;]=0, i= 1,2,3. (11.46) 
We then choose 
Y = 2(0 — T3) (11.47) 


as the generator of the U(1) group and refer to Y as the weak hypercharge. 
Unlike the T;s, Y does not obey any nonlinear commutation relations. Its 
scale and hence the proportional constant between it and (Q — T3) is strictly 
a convention. To obtain the correct electric charges for particles we must use 
eqns (11.45) and (11.47) and make the assignments 


Y(4)= —1, Y(er) = —2, 
Y(q,) = 1/3, Y(ur)= 4/3, Y(drk)= —2/3. (11.48) 


These hypercharge values may be remembered as twice the average charges 
of each multiplet, as the average T, value is always zero. We should note that 
the group structure allows for any hypercharge assignment. Thus ‘charge 
quantization’ (i.e. particle electric charges are integral multiples of some 
basic unit) is not automatic in the SU(2), x U(1), theory. This can be 
obtained only if the gauge group is semi-simple. We shall see in Chapter 14 
that this feature will be obtained when we combine this electroweak theory 
with SU(3) quantum chromodynamics into a ‘grand unified gauge 
theory’. 
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Anomaly cancellation. As we have discussed in §§6.2 and 9.3, the ABJ 
anomaly spoils the renormalizability of a gauge theory (Gross and Jackiw 
1972; Georgi and Glashow 1972b; Bouchiat, Iliopoulos, and Meyer 1972). 
The fermionic gauge couplings must not introduce anomalous Ward 
identities. Thus, for the fermion representation R with representation 
matrix T°(R), the trace tr({7*(R), T°(R)}7°(R)) in eqn (6.60) must vanish. 

In the present case, the fermions are either doublets or singlets under 
SU(2). The matrix T° will be either the Pauli matrix t° or the U(1) 
hypercharge Y. Since the group SU(2) is anomaly-free 


tr({t’, tbt") = 2 ô tr(1*) = 0, (11.49) 
we will consider cases where at least one of the Ts is the hypercharge Y. 
Because every member of a given SU(2) multiplet has the same hypercharge 
value, for the case of two Ts being a Y we have, 
tr(t YY) ctrt' = 0, 
and for the case of one T being a Y we have, 
tr({tf, ti} Y) =2 6" tr Y. (11.49a) 


Thus this anomaly contribution is proportional to the trace of Y (the sum 
of all fermionic hypercharge values) 


re) Y= >. YEZ E 


lepton quark 


But this vanishes by explicit calculation for the fermion assignments in each 
generation 


$ Y=-1x2-2=-4 


lepton 


y Ve3¢ x2 43) =4. (11.49b) 


quark 


The factor 3 in front of the parenthesis is due to the colour degree of freedom. 
For the case when all 7s are the hypercharge, we have from eqn (11.47), 


tr(YYY) = 8 tr(Q? — 307T, + 30T3 — T3) 
oc tr(O?T; — OT?) (11.50a) 
because tr 73 =0, and because we can ignore the Q? term as the 
electromagnetic current is a vector (V) and the VVV type of triangular 


fermion loops does not have anomaly. Explicit calculation of the right-hand 
side of eqn (11.50a) yields 


2 (QT, — QT3) = -4 41- = 


lepton 


È (ZT: — OT3) = @-$ -—F4+) =F. (11.50b) 


quark 


Thus the anomalies again cancel as in eqn (11.49) and we can conclude that 
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with the fermion assignments of eqns (11.43) and (11.48) the SU(2) x U(1) 
theory is free of ABJ anomaly. 

In fact there is a convenient way to remember the cancellation displayed by 
eqns (11.49) and (11.50). Using eqn (11.47) we can express eqns (11.49a) and 
(11.50a) as 


trYoatrQ 
tr(Q?T; — OT3) œ tr(T3Q Y) œ tr(T2Y) œ tr Q (11.51) 


because tr(7,;Q7) = tr(T3 Y°) = tr(T3) = 0. Thus the nontrivial contribution 
to the ABJ anomaly in SU(2) x U(1) theory is proportional to 


rQo=}Q0=0 (11.52) 


Lepton and quark charges cancel when the three colours are taken into 
account. [Remark: Given (11.52) one can easily show that the entire SU(3), 
x SU(2), x U(1)y standard model is free of anomaly. Namely, the ad- 
ditional triangle diagram contributions involving gluons and electroweak 
gauge bosons also mutually cancel. | 

With the fermions given in (11.40) the gauge-invariant Lagrangian takes 
on the form 


L=L£,4+ L, 
where 
L, = fiy" D,Y (11.53) 
with the covariant derivative 
D,Y = Q — igT : A, — ig’ i B, (11.54) 
For example, 
D, |, = (a, -if tA, + 2 B, Jk (11.55) 
and 
Der = (0, + 19'B, er: (11.56) 


We should note in particular that there are no gauge-invariant terms in Z 
that are bilinear in the fermion fields. Hence there are no SU(2) x U(1) 
symmetric fermion mass terms. 


Symmetry breaking via the Higgs mechanism 


We need to introduce a set of scalar fields ® and this set develops a U (1)em 
symmetric vacuum expectation value (®), so that we have the following 
pattern of symmetry breaking 


SU(2), x Uy > U Den 
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Three of the original four SU(2) x U(1) gauge bosons will become massive 
and one, corresponding to the photons, remains massless. 

As we have seen in §8.3, a complex scalar doublet breaks the SU(2) 
symmetry completely and one member of this doublet must be neutral in 
order to have possibility of a U(1).,-invariant <®> 9. We need a com- 
plex doublet with the charge assignment 


® = be} ¥(®) = 1. (11.57) 


The complete gauge-invariant Lagrangian including the scalar fields is 
then 


with 
L, = (D,®)'(D"®) — V (0) (11.59) 
where 
1 1 
D,® = (a, — zI’ i A, = 59'B, )® (1 1.60) 
and 
V(®) = -ROD + ADD). (11.61) 


The most general SU(2) x U(1) Yukawa coupling between scalars and 
fermions is given by 


G, = fl, Deg + fG Bug + f'%G, Oda + h.c. (11.62) 
with the isodoublet 
® = it Ọ* Š 


having hypercharge Y(®) = —1. 

As we have already discussed in §§5.3 and 8.3, for positive values of u? and 
à in eqn (11.61), we have spontaneous symmetry breakdown as the scalar 
develops VEV 


0 , 
KD) = <00> = ; with v = (u?/A)?. (11.63) 
2 
Particle spectra and interactions in the unitary gauge 


Using the polar variables for the scalar fields 


OH) =U Ol acy 


Z 


with 
U(C) = expliğ(x): t/v]. (11.64) 
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Thus the original two complex fields @*(x) and $°(x) in (11.57) are 
parametrized in terms of four real fields ¢;(x) and n(x). These shifted fields 
have zero VEV 


CO[C:10> = <O|7|0> = 0. (11.65) 


In order to display the particle spectra, we then make a gauge transforma- 
tion, i.e., go to the unitary gauge, by defining new fields. [Remark: We have 
simplified the expression in (11.64). In principle ¢; should be multiplied by 
all the broken symmetry generators, as in (8.127). For the case at hand, 
however, the difference is immaterial. ] 


a 7 0 
pag ere n(x) 
J2 
or 
4 . 0 
vo H, win =à) (11.66) 
and 


L = Ul, ek= er 


qı = U(O)g., Ur = Uur, dr = dr 
tT A 
2 


:) U~ 1O — igl VOIL T O 


B, =B 


u’ 


(11.67) 


We next express each Y, of the Lagrangian (11.58) in terms of the new fields. 
Consider first those containing ®s and use eqn (11.66) 


Fz = (D,,®’)'(D“®") — V(O’) (11.68) 
with 
Gay AG o.\) UAE NO) 
D,® = (0, -if0-a, if) J2 b (11.69) 
1 2.2 3 A 4 
V(@') = pn? + Avon +7" (11.70) 
and 
n(x) 


L4 = 2 [SC eer + Vuk +f ddr] 


+ me [fee + fauk +f ddk] +h.c. (11.71) 
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Mass spectrum. From the above equations one can easily read off the mass 
terms which are bilinear in the fields: 


(i) Scalar mass: (the physical Higgs particle) 


m, = /2 Uw. (11.72) 
(ii) Fermion masses: 
m, = f®v/ y2, 
m, =f®v/ v2, mg = fv/ 2. (11.73) 


(iii) Vector meson masses: The three ‘would-be-Goldstone bosons’ €(x) 
are transformed away in eqn (11.66). (We say they are ‘eaten’ by the gauge 
bosons to form three massive IVBs.) The vector meson masses are contained 
in the (D,®’)” term in 3. 


o AI a er ee I 
Samt (eae) eara X 


2 
= = {PEA +4] + (94,2 — 9'B,)"} 


= MiWi W~" +4M2Z,Z". (11.74) 


We need to make the following identifications of mass and charge 
eigenstates. For the charged vector mesons, 


2,2 
MaW W= o [(411) + (4127). (11.75) 
Thus, 
W+ = (A) F iA'2)/ V2 (11.76) 
and 
M3, = gv? /4. (11.77) 


For the neutral vector mesons, because U(1),,, is unbroken Q¢®), = 0, the 
associated gauge boson (the photon) will remain massless, we have 


l v? 4 1 4 
~ M3Z,Z" = A (gA? — g'B',)’ 


2 
v? g —ggN (4 
dati A. B' 
g ( u TE g? B" 


l Mz 0\/Z" 
= 3 Zo A i oae) (11.78) 


We have diagonalized the mass matrix by an orthogonal transformation 
Z,, = cos ĵĝwA; — sin Ow Bi, 


A, = sin OwA; + cos OwB’, (11.79) 
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with 
tan Ow = g'/g (11.80) 

and 
M3 = v*(g* +.9'7)/4. (11.81) 


The angle of rotation Ôw is generally referred to as the Weinberg angle. Using 
eqns (11.27), (11.80), and (11.81) we also have the ratio between vector 
meson masses 


p = M2,/(M3 cos? Oy) = 1 (11.82) 


Doublet Higgs and p = 1. The derivation of (11.82) is predicated on our use 
of the Higgs scalar in the doublet representation. To see this we retrace some 
of the steps in its derivation. Gauge invariance requires that the scalar 
potential in eqn (11.61) is a function of |®|?, which may be viewed as the 
length of a four-vector made up of the four real components of ®. Thus 
V(®) = V(|®|7) has a larger O(4) x SU(2), x SU(2)’ symmetry and an 
O(3) x SU(2) symmetry after SSB (11.63). We have already encountered 
such a symmetry-breaking pattern back in §5.3 with the SU(2) x SU(2) o- 
model. The multiplet (c, n) transforms as a (5,4) under SU(2) x SU(2) and 
as a four-vector under O(4) symmetry. The true vacuum singles out a 
direction (chosen to be o in that case) and the theory breaks down to an O(3) 
symmetry with z being still a degenerate triplet. In the case here A, remains 
to be a degenerate triplet under the remaining O(3) symmetry. Thus we have 
the crucial equality for the three terms that are bilinear in A, fields (11.74) 


2 


Once given that A} also has the mass term 4M į, the form for the neutral IVB 
mass matrix in (11.78) is then completely fixed by the trace (= M 2/2) and the 
determinant (=0) conditions 


2 2 ag2 1/2 3 
me E My (M2 — M2)? M2— M, B 


u 


$ DzoA) = L MRAN? + (42)? + (432), (11.83) 


The mass relation of eqn (11.82) follows immediately. 


Charged current. In order to identify the currents with those of V-A 
theory, let us next examine the fermion gauge interactions as contained in Z, 
of eqns (11.53) and (11.54) 


Ti g / g' , 1 pi à g 1 g' / / 
n (fx = ok + afge +78 Jai 
= 1 Lat = 2g' faa? ’ g' / 1 
= ERg'B'eR F Og Bu == dB dp 
=g ality lt + 34177 GLA” 
+ 39(—lyal + 371V qL — 2ERyyeR + für Ur — 3dRy,dR)B” 


= (gJ1A™ + gJ2A'?) + (gJ3A'%™ + 4g YB") (11.85) 


corresponding to charged- and neutral-current interactions. 
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For the charged current interaction in (11.85), we have 
Loo = (gS, A + GIZA) 


ee 
p 
with W7 given by eqn (11.76) and 


(Jt wre +J wW”) (11.86) 


Jį =J} +iJ? = viy,er + thy dy 
= 3V'y,(1 — y5)e' + 207, (1 — y5)d’. (11.87) 

For the low-energy four-fermion interaction shown in Fig. 11.1 we can 
generate the following effective Lagrangian 
~ 2M2, 
This is just the V-A theory. When eqn (11.88) is compared to eqns (11.3), 
(11.5), and (11.87), we can make the identification 

Ge g’ 

J2 8M3, 
Using eqn (11.77), this implies that the vacuum expectation value in (11.63) 
has the size 


cc 
L of 


he (11.88) 


(11.89) 


v = 27 1/4G7 12 ~ 250 GeV. (11.90) 


Neutral currents. For the neutral-current interaction in (11.85), we have 
Lye = JJ} A”! + 4g'J] B" 
= gJ} (cos OywZ" + sin OWA") 
+ tan Owg( Ji" — J})(cos OwA” — sin Oy Z") (11.91) 
where we have used eqns (11.79), (11.80), and (11.47). Thus 


Lyc = eJI A" + (g/cos Ow) J, Z" (11.92) 

with 
e = g sin by (11.93) 
J? = J? — sin? Oy J™. (11.94) 


The neutral currents can be written out explicitly in terms of the fermion 
fields 


Ja = yy Lol fast “ie GARY AR 
f 
l 5 z 
25 X CA — yS + IRJ + ys)f]  (11.95a) 


where f = v., e, u, and d. The weak neutral-current couplings are 


gi r= T3(ft.r) — Q) sin? Oy. (11.95b) 
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From egns (11.42) and (11.44) we then have 


I. = 3; IR = 0 
e 1 Som 2. Maced 
IL Se + sin Ow, IR = Sin Ow 
gi = 4 = å sin? Ow, IR = —2 sin? Ow 
gi = —it+tsin? 0w, gi =4sin? Oy. (11.95c) 


Starting from the coupling (11.92) we can then generate low-energy four- 
fermion interactions corresponding to the products of neutral currents 


2 
=g 
ye = J2 Jor 
" 2 cos? OyM2° " 

2M3, 

where eqn (11.82) has been used. The factor of 2 in the denominator comes 

from the symmetry factor in Feynman rule for two identical currents in 

(11.96). Thus the SU(2) x U(1) theory predicts a set of new weak interactions 

which are of comparable strength to the more familiar Fermi interactions of 

charged currents. As an illustration, consider the v,e elastic scattering; both 

charged and neutral currents contribute. The low-energy amplitude (given by 
Fig. 11.4) is 


Jor (11.96) 


2 


T(v.e T? V€) = ae {Lv.y,C1 > ysje]ley"( = Ys)Ve] 


—5[¥.y,(1 — ys)velLey"(1 — ys)e — 4 sin? Oweyte]}. (11.97) 


Fic. 11.4. Charged- and neutral-current contributions to elastic v,e scattering. 


After a simple Fierz rearrangement (see Appendix A) these two con- 
tributions can be combined, 


Gr z 
T(vee > ve) = Gy [Vall — ys\velLey(@ — bys)e] 
where 
a=2sin*O,+5 and b=4. (11.98) 


Of course the old V—A theory would have predicted a = b = 1. 

As we shall discuss in the next chapter, such a class of neutral-current 
processes has been discovered in high-energy experiments. All data are 
consistent with a value for the Weinberg’s angle in the neighbourhood of (for 
a review see Kim, Langacker, Levine, and Williams 1981) 


sin? Oy ~ 0.22 (11.99) 
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Discussions. The next major step in testing the standard theory will clearly 
be the discovery of charged and neutral gauge bosons having the prescribed 
mass values and couplings. From eqns (11.89) and (11.93) we have 


My = 27 °**eG; t? /sin Oy 
~ 37 GeV/sin Ow. (11.100) 


With the value of Ow given in eqn (11.99) we anticipate My œ 80 GeV and, 
with eqn (11.82), Mz = 90 GeV. Another crucial test will be the discovery of 
the Higgs particle 7. However the numerical value of m, as given in eqn 
(11.72) is not fixed by any previously measured quantities. This makes a 
search for the n difficult, particularly in view of its very weak couplings to 
fermions, being of the order of (m;,/My)e, as given by eqn (11.71). 

In fact the standard theory, even with only one family of fermions, has 
seven arbitrary parameters e, sin Ow, My, My, Me, Mu, Mg; Or in terms of the 
original symmetric Lagrangian the seven arbitrary parameters are the two 
gauge couplings g and g’, the two scalar self-couplings u? and 4, and the three 
Yukawa couplings: f©®, f", and f®. We note that the electroweak 
unification is in a sense not complete: we need to insert two gauge coupling 

-constants g and g’ to account for these two classes of interactions. The quest 
for further unification will be discussed in Chapter 14. 

Finally we emphasize that the guiding principle in constructing gauge 
models is gauge symmetry and renormalizability. We must include in the 
Lagrangian all gauge-invariant terms of dimension-four or less. Terms may 
be excluded without destroying renormalizability only with the imposition of 
the appropriate global symmetries. In the case of the standard electroweak 
theory no ad hoc global symmetry has been imposed. The conservation laws 
the theory possesses, such as those of baryon number and lepton number, are 
consequences of gauge invariance and renormalizability once the repre- 
sentation content of fermion and scalar particles are given. Thus both 
leptons and quarks in each fermion family must be present in order that 
renormalizability is not spoiled by the ABJ anomalies. On the other hand 
cross-couplings between leptons and quarks are absent in eqn (11.62) 
because they are forbidden by gauge symmetry. For instance, if there were a 
set of scalars transforming as a doublet under the SU(2) symmetry and as a 
triplet under SU(3).our: ht GG = 1,2; « =1,2,3), then we would have 
the gauge-invariant Yukawa couplings |, hig% and G,, 34,€"" corresponding 
to baryon- and lepton-number nonconserving terms. Thus the structure of 
the Weinberg—Salam theory automatically leads to B and L conservations. 
[Here we ignore the unobservably small instanton effects (t Hooft 1976). ] 
Lepton number violation by Majorana neutrino mass terms will be studied in 
§13.2. Baryon- and lepton-number conservation laws are both violated in 
grand unified gauge theories (see Chapter 14). 


11.3 Fermion family replication 


In $11.2 we have shown how to construct the basic version of the standard 
electroweak gauge theory with a restricted fermion content: the 15 two- 
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component fields of eqn (11.40). The left-hand (LH) fields transform as 
doublets and the right-hand (RH) fields are singlets. Now we would like to 
remove this restriction. Eqns (11.5) and (11.9) would lead us to expect further 


LH fermion doublets 
a (<) (11.101) 
do/t Se/L 


KEN 


dą = cos d + sin 0,s 


with 


Sy = cos 6.s — sin 0d, (11.102) 


again with all RH fields being singlets. In other words there is a ‘duplication’ 
of the fermionic structure in the theory. With the discovery of t, v, leptons 
(Perl et al. 1975, 1977) and the b quark (Herb et al. 1977; Lederman 1978) we 
see that this replication continues: the experimental data do not contradict 
the expectation that (v,, tT), is an SU(2) doublet and that there exists an even 
heavier t quark which will complete the doublet with b. The task of this 
section is to study the systematic incorporation of these fermions into the 
standard model. We would like to understand the presence of the Cabibbo 
angle in quark doublets and the absence of a corresponding mixing angle in 
the lepton sector. Other important issues related to the multifamilies of 
fermions are the suppression of flavour-changing neutral-current effects and 
the CP-violation phases in fermion gauge couplings (also see §12.2). 


Gauge vs. mass eigenstates 


The presence of the Cabibbo angle in (11.102) already indicates that we must 
distinguish between two types of fermionic states: gauge interaction eigen- 
states (having definite gauge transformation properties, e.g. the d, and s, 
fields) and mass eigenstates (e.g. the d and s fields); they are related by some 
linear transformations. This is because the fermions are massless before the 
spontaneous symmetry breaking and the fermion mass eigenstates are 
determined by the Yukawa coupling after the spontaneous symmetry 
breaking (SSB). The replication of fermionic structure alluded to in the 
introductory remarks means that there are several groups of identical gauge 
eigenstates. Thus the fermionic states actually carry a ‘family’ index or 
‘generation’ index: A =e, p, T,.... In §11.2 we simplified our construction 
by taking only one family of fermions. In this approximation there is no 
distinction between gauge eigenstates and mass eigenstates. Since only the 
gauge transformation properties were used, the fermion fields in §11.2 should 
be regarded as those of the gauge eigenstates. The proper multi-family 
generalization would then involve replacing each of these gauge eigenstates in 
§11.2 by a vector in the family index space. We need to make the following 
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substitution: 

e => e4 = (e, p’, T’) 

Ve > V4 = (Ves Vas Ve) 

u > p4 = (V, c’, t’) 

d > n; = (d’, s’, b^). (11.103) 
The SU(2) doublets are then 

l= e and dy = (?*) | (11.104) 
C4/L Nn4/L 


The prime on the field indicates it is a gauge eigenstate. (This represents a 
change of notation from that in §11.2 where the primed states are fields in the 
unitary gauge.) The fermionic couplings in the Lagrangian also involve a 
contraction of family indices. The generalizations of eqns (11.53) and (11.62) 
are 


, i ig' , , 
Ly = ly) z- ae “K+ T a)i T E'ri( + ig’ B) esr 
E ig ig’ 
F aai( Z- p K — © p)au 


=7/ ? 2i i 1 =f. 28 l 1 
F pai( 7- = #) Par + Mai( 7+ = Prin (11.105) 
and 
LPa = f Shla® epr +f Bd 4L OP pp 
+ f G4. Onze + hic. (11.106) 
Thus the Yukawa coupling constants in eqn (11.62) are replaced by coupling 
matrices in the family index space. After SSB, given in (11.63), we have 
x e a / oat / nN) / 
Fi = [fe Gar ear +f iDa Por +S TRL npr] 
id (e) xr a (Pp) ar wy (n) or p 
T V2 L faze step + J žb PaL Par + fae Ngee] + h.c. 
(11.107) 
which is a generalization of (11.71). Thus in the gauge-eigenstate basis the 


fermionic mass matrices are 


M®, = ——f®  i=e, p,n. (11.108) 


Biunitary transformations. The important point is that there is no reason for 
these mass matrices to be diagonal; in fact generally they are neither 
symmetric nor hermitian. We shall now demonstrate that this type of 
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matrices can be diagonalized by biunitary transformations, i.e. given My, 
there exist unitary matrices S and T such that 


S'MT = M; (11.109) 


where M, is diagonal with positive eigenvalues. Basically the points are that 
any matrix M can always be written as the product of a hermitian matrix (H) 
and a unitary matrix (V) 

M = HV (11.110) 


and the hermitian H can then be diagonalized by some unitary matrix. 
The proof proceeds as follows. MM" is hermitian and positive; it can be 
diagonalized by an unitary matrix S$ 


S'(MM')S = M3 (11.111) 
with 


The matrix S is unique up to a diagonal phase matrix; 1.e. if eqn (11.111) 
holds, then 

(SF)'(MM')(SF) = Mj 
with 


F= e's | (11.112) 


These phase degrees of freedom will be studied in more detail when we take 
up the question of CP-violating gauge couplings. Here they can be used to 
ensure that all eigenvalues of M; in (11.109) are positive 


Mı 
M; = m, m; > 0. (11.113) 
M3 
Define a hermitian matrix H by 
H = SMS’. (11.114) 
Then we can show that V defined by 
V=H'M and V'=M'H"! (11.115) 


is a unitary matrix because of eqns (11.111) and (11.114) 
VV’ = H-'1MM'H™! 
= H™'SM?S'H~! 
= H~'(SM,S"')\(SM,S")H ~! 
= H~'HHH™' = 1. (11.116) 
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From the definitions of H and V (eqns (11.114) and (11.115)) we have 
S'HS = S'MV'S = Ma (11.117) 
or 
S'MT = M, (11.118) 


where T = V'S is also unitary; this is the promised result. Thus the relation 
between gauge eigenstates and mass eigenstates follows 


ViMwe = (WiLS)(S'MT)(T'We) 


= WiMaWr (11.119) 
with 
Y = SY 
R= Twp. (11.120) 


Mixing matrix in the quark charged current couplings 


We now apply this result to the charged weak current for quarks as derived 
from eqn (11.105) or from eqn (11.87) by the substitution of eqn (11.103) 


Jan = GarYpt Gar 


= PALY NAL 
= PALY uS ip S]a (11.121) 
where 
pi = SPL 
ni = Swn- (11.122) 


Thus, in terms of the mass eigenstates, the quark doublets of the three 


families are 
u C t 
; 11.123 
qaL a e a ( ) 


where 
d” d 
s” }=UI s (11.124) 
b” b 


with U = StS. Clearly U is also a unitary matrix. 


CP violation phases. If the mass matrices were real, U would be an 
orthogonal matrix, and all fermion gauge couplings would be real and they 
could not induce CP violations. On the other hand a general 3 x 3 unitary 
matrix, can be parameterized by the three real rotational angles plus six 
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complex phases. Not all complex phases have physical meaning however, as 
some of them can be removed by a redefinition of quark fields (except for the 
charged current all other terms in the Lagrangian are diagonal in quark 
flavours). At first sight one would think that six quark flavours can absorb six 
phases. (Keep in mind that q, and qr must be rotated by the same phase to 
keep the masses real.) This is actually not correct as the mixing matrix U is 
invariant when we change all quarks by the same phase. Thus only five 
phases can be removed by redefinitions. To see this explicitly, let us start with 
the first doublet in (11.123) 


u 
= ; 11.125 
dil T JUSt m ( ) 


If U,, has phase 0, i.e. 


Uà = R,,e° Rii real, (11.126) 
then this ô can be pulled out by a redefinition of the u-quark field 
u > u' = ue” (11.127) 
and 
= e” = , 11.128 
we bas PUSA = ee 


Similarly, we can factor out the complex phases of U,, and U}, by a 
redefinition of the c and t quark fields in eqn (11.123). These overall doublet 
phases are immaterial because there are no gauge couplings in eqn (11.105) 
between doublets with different family indices. Finally we can absorb 
two more phases of U,, and U,, by a redefinition of the s and b fields. The 
doublets of (11.123) take on the form 


u’ c’ 
a + Ris’ + | T + Rets’ + TE 


t x 
| 11.1 
T + R32€'3s' + elie (11.129) 


We have now reduced the number of parameters to 13, (9 R;;s and 4 ô;s). The 
normalization of each state gives three conditions and the orthogonality 
conditions among the different states yield six conditions. Therefore we are 
left with four independent parameters. Since we need three parameters for 
the real 3 x 3 orthogonal matrix, we end up with one independent phase. 

We shall summarize the above counting procedure by applying it to the 
general case of n-doublets: A complex n x n matrix has 2n? real parameters 
which are reduced down to n? when the unitary condition is imposed. (2n - 1) 
phases can be removed by redefinitions of quark states. Keeping in mind that 
there are n(n - 1)/2 parameters (angles) in an n x n orthogonal matrix, we 
arrive at the number of independent physical phases 


n? — (2n — 1) — n(n — 1)/2 = (n — 1)(n — 2)/2. (11.130) 
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Thus for a two-family theory there is one mixing angle—the Cabbibo 
angle—and no phase. Only for a theory with 2 x 3 = 6 quark flavours can 
one obtain a nontrivial CP-violating phase in the quark gauge couplings. 
This observation was first made by Kobayashi and Maskawa (1973) and the 
unitary matrix (11.124) is often referred to as the KM matrix. The 
phenomenology of CP violation in gauge theories will be discussed in §12.2. 


Neutrino mass degeneracy and the absence of leptonic mixings 


We have seen that the standard theory gives a natural explanation for the 
presence of the Cabbibo angle and CP phases in quark charged currents. 
Similarly the same theory helps us to understand the absence of such features 
in the lepton sector; the masslessness of neutrinos implies that these mixings 
are physically unobservable. This can be seen as follows. Just as in eqn 
(11.121) we can write down the charged weak current for leptons 


Jin = layat lay 


= VALY „EAL 
= Va [St Selanne (11.131) 
where 
VE = Sw Vi 
eL = Sweet (11.132) 


and the lepton doublets in terms of mass eigenstates should be 


«(OLE 
^ eji wh th (11.133) 


where 
Ve Ve 
v J=VI yu (11.134) 
vi V: 


with V = Si So. However, neutrinos are massless, hence degenerate. Any 
unitary transformed v-states can be taken as mass eigenstates; in particular 
VaL = Wat Or V may be'set to be the identity matrix. Thus, nontrivial lepton 
mixing angles can never show up in any physical processes if the vs are 
degenerate. In this connection we should remark that the conventionally 
referred to neutrino states v, are really weak-interaction (gauge) eigenstates. 
The v, is operationally defined to be the ‘invisible’ particle missing in the 
nm > l decay and/or detected by /-production in v-matter scattering. With 
V = 1, different lepton families are completely decoupled. 


Remark: Here we assume that there is only one set of Higgs doublet. 
Thus the Yukawa couplings are directly proportional to the fermion mass 
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matrix. Mass matrix diagonalization then also diagonalizes the Yukawa 
couplings. With more doublets the Yukawa couplings are usually flavour- 
changing (Bjorken and Weinberg 1977). However crude estimates would 
indicate that such a flavour-changing mechanism brings generally too large a 
rate for strangeness-changing neutral-current and muon-number violation 
effects. 

The standard theory (with one doublet of Higgs scalar) possesses global 
symmetries corresponding to the separate conservation of e-, u-, and t-lepton 
numbers. Processes such as p > ey, K? — pe, etc. are forbidden to all orders. 
Again we emphasize that these features are not ad hoc inputs but are natural 
consequences of the theory when neutrinos are degenerate. The possibility 
that vs may have small masses will be explored in §13.2. Even for massive 
neutrinos we would expect that the nontrivial lepton mixing angles would 
always appear multiplicatively with neutrino mass differences. Hence, muon- 
number nonconservation effects are invariably suppressed. These remarks 
will be illustrated with explicit calculations in Chapter 13. 


Flavour conservation in neutral current interactions 


The presence of the unitary transformation (11.124) means that hadronic 
charged currents are flavour nondiagonal and strangeness is not conserved in 
charged-current weak interactions. On the other hand, consistent with 
longstanding experimental observations, (11.124) does not bring about 
flavour-changing neutral currents. When we change in eqn (11.95) 


Ja GLY PiarMePar + GRY, ParYuP ar 
A A 
+ gi yA Ny YD + gk 5 NyRYuNAR (11.135a) 
A Á 


the gauge eigenstate bases to mass eigenstate bases through (11.122), the 
unitarity condition means that we can simply remove the primes on the 
above equation to obtain 


Ja = L (AA Vuh + Ih Ryu SR) (11.135b) 


for f = Ve, Vus Vz, €, H, T, U, C, t, d, s, b, .... This is possible because there is 
fermion family replication, in the sense that fermions with the same charge 
and helicity have the same gauge group transformation property. In fact this 
was the original motivation which led Glashow, Iliopoulos, and Maiani 
(GIM) to introduce a charmed quark having the same charge and weak 
couplings as the up-quark. However the GIM mechanism means much more 
than the mere absence of flavour-changing neutral-current coupling in the 
Born approximation. Since the weak gauge coupling g is of order e (eqn 
(11.93)), one must be watchful that higher-order effects do not give rise to 
O(G,a) amplitudes for strangeness-changing neutral-current processes such 
as Kı > pp". (Experimentally they are observed to be at most O(G?).) We 
shall study the GIM cancellation mechanism in the next chapter. It suffices to 
point out that the same relationship between mixing angles and fermion mass 


11.3 Fermion family replication 363 


difference mentioned above in connection with muon-number nonconserv- 
ation applies here: the induced amplitudes as expected, are of the magnitude 
œ g’Gp Am2/M¥, ~ G Am? , where Am, is the quark mass difference and we 
have used eqn (11.89). 


Summary 


Let us summarize the discussion on the fermion sector of the standard 
model. The fermion gauge eigenstates display a repetitive structure and they 
can be grouped into families. Each family consists of 15 two-component 
fields of leptons and quarks, with four LH doublets and seven RH singlets. 
The ABJ anomalies due to the lepton and quark loops precisely cancel. Thus 
the standard model with one family would be a completely self-consistent 
theory. With more than one family the principal new physical feature is the 
presence of mixing angles and complex phases in fermionic gauge couplings. 
This is because gauge invariance allows for nondiagonal mass matrices for 
each set of the same charged fermions. The mass eigenstates are introduced 
when we diagonalize these mass matrices. They are related to gauge 
eigenstates by unitary transformations. The procedure is irrelevant for 
leptons because of neutrino mass degeneracy. Thus in the standard theory 
with three families there are three mixing angles and one CP violation phase 
in the quark gauge couplings. Altogether we need to insert 17 parameters in 
such a theory (see the discussion at the end of §11.2). At present there is no 
deep understanding of this repetitive fermion structure. (The existence of 
family symmetry group? Lepton and quark substructure?) This is usually 
referred to as the fermion family problem. 


12 Standard electroweak theory II: 
phenomenological implications 


12.1 Flavour-conserving neutral-current processes 


A basic requirement for any gauge model of electroweak interaction is that it 
should contain the well-established V-A charged weak currents in the low- 
energy limit. But the Glashow—Weinberg—Salam model also predicts a set of 
new low-energy phenomena associated with the neutral weak currents. Their 
experimental discovery and the subsequent confirmations in detail of the 
predicted structure have brought about the general acceptance of this model 
as the standard theory of the electroweak interaction. In this section we shall 
discuss processes corresponding to the tree-diagram neutral currents, which 
are diagonal in flavour space (see eqn (11.135)). Flavour-changing neutral- 
current effects as induced by higher-order loop diagrams will be studied, 
along with CP violation, in the next section. 

For momentum transfer much less than the vector gauge boson masses, 
the effective Lagrangian for weak-current interaction may be written 


l l 
g Len = Ler + Lor 
_ Suir + pJ? J) (12.1) 


where Ji, and J? are given in eqns (11.87) and (11.95) respectively, and p 
measures the relative strength of charged-current (CC) and neutral-current 
(NC) processes 


Mw 


~ M2 cos? by oe 


p 


For the standard theory with doublet Higgs phenomenon only (eqn (11.82)) 
p is fixed at 

p=. (12.3) 
The NC part in (12.1) reflects the basic coupling (eqn (11.92)) 


gre 9 _ zeuyo 12. 
cos by j Ces) 


with the neutral current being diagonal in quark—lepton flavours f = l, v,, u, 
d,s,c,... 


Ja = HSI fi + IRI RY fr 
= sof fy — ys) f+ oh fv + 95)f] (12.5) 
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where 
gl = Tfi) — Q(f) sin? bw (12.6) 
Gx = T3(fe) — QC) sin? bw (12.7) 


where T, and Q are the third components of weak isospin and electric charge, 
respectively. We have deliberately parametrized the NC pieces in a suf- 
ficiently general form to include all models based on the group SU(2) x U(1), 
i.e. models having different fermion content and representation assignments. 
The standard model values gf , for the first fermion family is given in eqn 
(11.95). Universality (and fermion replication) implies that, for i = L, R, we 
have gf = gi = gi, gi = gi = g}, gf = Gi = g?, and of course, all neutrinos 
have gj = 1/2. Thus in the standard model prediction for neutral currents 
there is only one unknown parameter: the Weinberg angle Ow which reflects 
the relative coupling strength of the SU(2) and U(1) gauge group factors. 

The neutral currents were first discovered by the Gargamelle collaboration 
at CERN (Hasert et al. 1973). These and most of the subsequent data are on 
inclusive neutrino scattering off an (isoscalar) hadronic target. However we 
choose here to discuss first the theoretically simpler situation of leptonic 
neutral-current processes. 


V, + e > V, + e and V, +e >V +e 


These two processes are purely neutral-current processes, described by 
diagrams in Fig. 12.1. The amplitude for v, + e > v, + e is 


2 
g 7 
DS eal a Jae aye! a k 


x {ai(p’, ADEE” — ys) + grv“(1 + ys)] ul, A}. 


(12.8) 
vu (k) vu(k’) Vu Vu 
Z Z 
e(p) e(p’) e e 


Fic. 12.1. 


Summing over the final spin and averaging over initial electron spin (v, is 
always left-handed), 


LY ITA, 4)? = (SE) tk KEEL- p, p') 
2 a ASt = J2 uy \Y’ L , 


+ (gR L**(p, p')] (12.9) 


where 
Lik, k’) = tr[k’y, (1 a ys)ky,(1 + Ys)] 
= 8(k,k, + k'k, — KK Gyy F i€augsk’ k’). (12.10) 
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We have neglected lepton masses; thus 
(k:p)=(k'-p’) and (k':p)=(k-p’). (12.11) 
The leptonic tensor products in (12.9) are 
Link, k)\L**"(p, p') = 128{[(k p) + (k: p)] 
+ [(k -p — (k’-p)*}} (12.12) 
and 


5 DITA = (166.9. POG)? + (APK DPD. (1213) 


The cross-section is given i 


_ l l 4 s4 ! , l 2 
o- Amt ork +p —k p51 
370 Sash 
ee (12.14) 
(27)°2ko (22)? 2p 
The phase space integral can be calculated easily 

dk’ d3p’ 1 dE 
4 54 EN ae , Srne ee A I 12.15 
jen ERE P) OIE On 2b. 8n E “> 


where F£,(£,,) and Æ, are the initial (final) neutrino and final electron energies 
in the laboratory reference frame. Then we have 


dø(v,e) _ l x 
dE,  32nm zrl 2 IT; 


—_ 


—— (IV L(g)” + (Ge)? (E/E. (12.16) 


In terms of the conventional scaling variable (of fractional energy transfer), 


y= E/E, (12.17) 
we have 
do(v,e) 8G? ; . : 
gg EGU? + (G8) — PT 12.18) 
and the total cross-section is 
8G? v\2 e\2 1 e\2 
a(vye) = — mE (Gi) Gi)? +H]. (12.19) 


It should be noted that the (1 — y)? term in (12.18) and its corresponding 1/3 
factor in (12.19) are associated with the term having the opposite helicity to 
that of the incoming neutrino. Thus the cross-section for the anti-neutrino 
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process V, + e > V, + e can be inferred to be 


2 


8GF 2ri/„e e 
Ae) = ECE mE gi) CHO)? + (R (12.20) 


since V, has opposite helicity to v,. From eqns (12.19) and (12.20) we see that 
measurements of the total cross-sections o(v,e) and o(v,e) map out ellipses in 
the (gi, gg) plane (see Fig. 12.3) and these ellipses intersect at four allowed 
regions. The four solutions reflect the two sets of sign ambiguities: the cross- 
section results are insensitive to the substitutions (i) g œ —g,, Gr —Jr; 
and (11) gL œ g1, gr — Gr. The latter substitution is simply a vector—axial- 
vector ambiguity. Thus we need two more independent measurements to 
resolve them. 


Ve + € > V. + e and V, + e > V +e 


In these two processes both the charged and neutral currents contribute (see 


eqn (11.97) and Figs. 11.4 and 12.2). The cross-sections are computed to 
be 


8G? i 
o(vee) = —— MELASE + gi)” + 3(9%)"] (12.21) 
= 8GF v\2r1 e\2 ea2 
o(v.e) = Ea mE (gi) BA +91) + (gr) J. (12.22) 
$ e 
Z 
e e e e 
Fic. 12.2. 


Fic. 12.3. ve scattering and e*e~ annihilation experimental results for neutral-current 

couplings of the electron. gy = (gi + gR) ) and ga = (gi — gp). The only allowed region 

common to all four types of pure leptonic processes corresponds to that of the standard model 
with sin? 0, œ 0.22 (Barber et al. 1981). 
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The cross-section in (12.22) has been measured by using reactor neutrinos. 
This resolves the sign ambiguity (i) mentioned above as the sign of 
interference of the CC and NC amplitudes is now known through the 
linear g, term in (12.22). This still leaves us with two possible solutions in Fig. 
12.3 due to the vector and axial-vector ambiguity (11). 


The neutral-current effect in e*e7 > p*p7 


The weak Z vector boson can contribute to Bhabha scattering e*e~ — e*e7 
as well as to e*e7 — "py, ttt. We shall concentrate on the simpler case 
ofe*e” > p*p (or tt”) reactions with the diagrams shown in Fig. 12.4. 


Fic. 12.4. 


The cross-section can be worked out to be 


do na? 
— —[A 2 B 0 , 
eoD Os [A(1 + cos“ 0) + B cos 0] (12.23) 
with 
A=1+4+2Reyxgigh +I ER + AIN + (94)7] 
(12.24) 
B=4Reygagh + 81xl’9V GN ISIS (12.25) 
where 
G-M2 S 
a ee? (12.26) 


K= 22ra (s — M2 +iM,T) 


gv = (gu + gr) and ga = (gı — gr). 0 is the angle between the outgoing u` 
relative to the incoming e~ and T is the width of Z boson. The leading term in 
A of course is the pure photon contribution; the Re y terms correspond to the 
interference of the neutral current with the electromagnetic current; the terms 
proportional to |x|? are pure Z-boson contributions. Because of the presence 
of the cos 0 term in (12.23) we have a forward-backward (FB) asymmetry 


få d cos 6(da/d cos 0) — f°, d cos O(da/dcos 0) 3B 
f+, d cos @(da/d cos 0) 8A 
(12.27) 


At low energies s « M?, we have 


(12.28) 
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8sGp 
/2 e? 


where we have used universality to set gy a = gẹ a. We can see two weak 
effects. 


Bx — g2 (12.29) 


(1) Modification of the total cross-section o(e*e 
QED. At low energies this is proportional to g%. 

(2) The forward—backward asymmetry in the angular distribution, which 
at low energies measures gi. Already at the highest PETRA energy, these 
weak effects can be measured to map out an allowed region in the (gy, ga) 
plane of Fig. 12.3. This singles out one of the remaining two solutions—that 
of the standard theory with sin? 0 x 0.22. 


—> u*p) from that of 


Neutrino—hadron neutral-current processes 


While the pure leptonic processes considered above have the advantage of 
being free of strong interaction complications, their usefulness is diminished 
somewhat because the experimental data in this area generally have poor 
statistics. This is in contrast to the neutrino—nucleon scatterings where we 
have abundant and precise data, especially those of inclusive v scatterings 
from isoscalar targets. To interpret the experimental results in the ap- 
propriate kinematic region one invokes the quark—parton model. In this way 
one can extract the first-generation quark weak couplings gi in an 
analogous manner to the v-lepton scatterings considered above. For instance 
for the deep inelastic scattering v + N > v + X from an isoscalar target 
N = 4(n + p) (similar to eqn (12.18)), we have for gj = 4 


NC 2 
o — EATE D + (gI + D? + DP — 99} 


(12.31) 


where 
1 


Q = EDG + d(x)] dx (12.32) 


0 


with u(x), d(x) being the quark distribution functions depending upon the 
Bjorken scaling variable x, and we have set the antiquark distributions 
u(x) = d(x) = 0 (see eqns (7.72) and (7.78)). We note that the coefficient in 
(12.31) is just the parton cross-section value for the charged-current process 
v+N> l +X, 


do““(VN) _ GEME,Q 


12.33 
ay (12.33) 


When the ratio R, = o*“(vN)/o““(vN) is taken, we are left with the weak 
couplings that we are seeking. Exactly similar results can be obtained for 
anti-neutrino scatterings, but keep in mind that here the (1 — y)* factor is 
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associated with the g2s and do““/dy. Thus, o“°(VN) =40°°(vN). One finds 


oN (vN) + oS (VN) 


O WN) +O WN) a2 d\2 u\2 d\2 
(VN) + oC ON) (gi) + (91) + (gR) + (GR) (12.34) 


ao (vN) — o“S(WN 

cen ak any 7 (9)? + (at? — (oR)? D 1239) 
Just as in the cases of v,e and V,e scatterings, we need to supplement these 
vN, VN measurements with data from proton and neutron targets, or from 
semi-inclusive pion production off isoscalar targets, to resolve the sign and 
isospin ambiguities. We shall not give the details of this here. Suffice it to say 
that the results are again in accord with the standard theory. Here we will 
only mention a plot of the function corresponding to the standard model 
value for (12.35) of 


oN (VN) — o“S(WN) 1 


SON oN 720 — 2 sin? Oy). (12.36) 


This is the Paschos- Wolfenstein relation (1973). Thus measurements of R, 
and R, immediately yield a value of sin? 0w. The most recent and accurate 
data are from CDHS (Geweniger et al. 1979) and CHARM (Jonker et al. 
1981) collaborations. 

It should be mentioned that in the actual phenomenological analysis much 
more detailed calculations of sea-quark contributions, QCD corrections, etc. 
are taken into account. Further discussion on the phenomenological values 
of the Weinberg angle and the p-parameter (12.2) will be presented in §12.3 
when we discuss the W and Z intermediate vector bosons. 


The electron—deuteron asymmetry measurement; a historical note 


Although we have presented the electron weak coupling results gi pz as 
measured in the leptonic ve and ete” reactions, historically the first 
elucidation of the electron weak couplings came from the semileptonic e-D 
scatterings in a beautiful experiment performed at SLAC by a SLAC~—Yale 
collaboration (Prescott et al. 1978). At the time there was considerable 
confusion over the structure of electron weak couplings as a number of 
searches failed to detect atomic parity violation effects at the level predicted 
by the standard theory. There were also claims of experimental effects 
(an ‘anomalous’ y-distribution in vN scattering) that might be interpreted as 
indicating the presence of non-trivial V +A charged currents, etc. 
Consequently a number of variants to the Weinberg—Salam model were 
proposed to account for these observations. The SLAC experiment measured 
parity violation asymmetry 


_ o(A = 1/2) — ao(à = —1/2) 
-= alà = 1/2) + ao(à = —1/2) 


where o(A = +1/2) is the double differential cross-section d?a/dQ dE, for the 
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scattering of right- and left-handed electrons on deuterons. From the 
observed asymmetry parameter and the prior knowledge of gi: % one can then 
deduce gj. r. The SLAC experiment was decisive in showing that the 
standard model was practically the only viable theory. 


12.2 Weak mixing angles, the GIM mechanism, and CP violation 


In this section we study the phenomenological consequences of the multi- 
family structure of the standard theory as presented in §11.3. In particular, 
we will concentrate on the strangeness-changing neutral current and the CP 
violation in the K°-K° system as examples illustrating the implications of the 
GIM mechanism. 

We have shown in §11.3 that, because the Yukawa couplings generally 
involve fermions belonging to different particle-families (or generations), 
flavour-space fermion mass matrices are not diagonal. This means that mass 
eigenstates are different from weak eigenstates which have definite gauge 
transformation properties. For the simple case of two generations of 
fermions, this produces the Cabibbo mixing of the quarks in the charged 
current, and the quark weak eigenstates are 


u c 
Gl (o) UR; Cr; dr, SR 
d’ j d T cos @. sin 0. \/d 
JJ FAs), \—sin@, cosb \s/r 


i.e. the weak eigenstates d’, s’ are rotations of mass eigenstates d, s. Note that 
the Cabibbo angle 6, is the difference of the rotation angles between the 
(uL, C1) and (d,, s1) sectors. This is why there is no mixing angle in the right- 
handed fermion sectors because (Up, Cp)s do not couple to (dp, Sp)s. The 
neutral current which is proportional to the operator (Q sin? Ow — T3) (see 
eqn (11.94)), has the important property that it is flavour-diagonal (or 
flavour-conserving). This follows from the fact that all fermions with the 
same charge and same helicity have the same transformation properties 
under the gauge group SU(2) x U(1) (Glashow and Weinberg 1977; Paschos 
1977), so that the rotation matrices such as the above 2, commute with the 
neutral-current operator (Q sin? Ow — Tı). For example, in the (d,, sı) 
sector, the (1 — y,) part of the neutral current is 


y E- l : l d' 
Ja(d, s) = (d',5 x| -3 sin? Ow + T) 


a I. l j/d 
= (d, | 3 sin? Ow = 10) 


which is flavour-conserving. Actually, this originally motivated Glashow, 
Iliopoulos, and Maiani (1970) to introduce the c quark coupled to s; so that 
s, has the same SU(2) x U(1) quantum number as d; to cancel the 


and 


372 Standard electroweak theory II: phenomenological implications 12.2 


strangeness-changing neutral current (GIM mechanism). Otherwise, it 
would give rise to the order Gp, AS # 0 neutral-current processes (e.g. 
K, > utu, K*—2*vv) which is phenomenologically unacceptable. 
It should be emphasized that the GIM mechanism achieves this suppression 
without any artificial adjustment of the parameters in the theory. In fact, 
as we have mentioned in §11.3, the GIM mechanism means much more 
than this tree-level cancellation; it also provides the additional suppression 
for the AS #0 neutral currents that are induced by higher-order loop 
diagrams. The need for this additional suppression is that without it these 
induced amplitudes would be of order Gpæ while the experimental data 
on these processes are typically of order Gm? (with m being a few GeV). 
We shall illustrate this suppression mechanism with a calculation of the 
K-K, mass difference: Am = 0.35 x 10714 GeV. 

We mentioned in §11.3 that, for the case of three generations of fermions, 
one has the additional feature that there can be a CP-violating phase in the 
mixing matrix. We shall discuss the implication of this in the calculation of 
the CP-violating state-mixing parameter € in the neutral kaon system. 


Cabibbo—Kobayashi-Maskawa (CKM) mixing matrix 


The basic charged-current (CC) interaction of eqn (11.86) with the mixing 
introduced by the need to diagonalize the quark mass matrices (eqn (11.124)) 
leads to the following CC couplings. In the simple two-family case, we have 
the familiar Cabibbo rotation, augmented by the GIM charmed quark, 


I . af cos@ sind@,\/d n 
Sea W ç: : 
Lc wD (u, C)Ly e D deus ws), u + he (12.37) 


Since in this case the mass matrices can be taken to be real, the unitary 
transformations are just ordinary rotations. Also the amplitude for any 
strangeness-changing processes d +> s must be proportional to (m. — m,) 
sin ĝ,, as in the m, = m, limit one can always choose one linear com- 
bination of c and u so that it does not couple to both s and d. A similar 
situation holds if we have d and s degeneracy, etc. In the three-family six- 
quark case, the mixing matrices are not just ordinary orthogonal matrices; 
we have 


d 
Loo =A (0,2 U| s | wt the. (12.38) 
ai 
b /L 
where the unitary matrix 

Uia om Op 
U = Uca Ue ORS (12.39) 

Ua Us Uv 


can have one complex phase and can be parametrized in a form first 
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introduced by Kobayashi and Maskawa (1973) 


Cy S1C3 S153 
U =f —s,c, C1C2C3 — S253 €° C1C3853 + S2C} e” (12.40) 


—S1$2 C823 + C283 € C,S,8,; — CC; e” 


where we have used the abbreviations c; = cos 0; and s; = sin 0;. By suitable 
choices of the signs of the quark fields, we can restrict the angles to the ranges 
0 < 0; < n/2 and -n << T. 

In this KM parametrization 6, corresponds closely to the Cabibbo angle. 
It is also clear that 6; must be small because of the observed validity of 
approximate Cabibbo universality. To be more precise we recall that, from 
the muon lifetime and from the super allowed 0* — 0* nuclear B-decays, one 
can extract two values of the Fermi constant, which we indicate by G À and 
GF respectively (R stands for renormalized parameters). Using (12.39), 
one has the theoretical prediction 


[Gi/GE] = Un [l + (A, — âr)] (12.41) 


where A, and Ap are, respectively, the radiative corrections to Gi and G in 
which their finite part are expected to be different for u- and for B-decay. This 
difference contributes a non-negligible correction ~2 per cent. One finds 
(Sirlin 1978, 1980). 


IUa] = 0.9737 + 0.0025. (12.42) 
Also an overall fit to hyperon decays yields (Shrock and Wang 1978) 
|U,,| = 0.219 + 0.003. (12.43) 
Combining the constraints (12.42) and (12.43), one finds that 


|Uual? + |Uusl? = 0.996 + 0.004. (12.44) 


As the central value is less than one, this indicates a ‘leakage’ of u-quark 
coupling to the b-quark by 


IUl = 0.06 + 0.06 (12.45) 
which can be translated into the KM angle of 
ls3| < 0.28. (12.46) 


Thus 0, may be as small as (if not smaller than) the Cabibbo angle. At this 
stage of our knowledge, in order to obtain constraints on other KM 
parameters one needs to proceed by rather indirect routes, for example, by 
calculations of K°< K° parameters (see calculation below). All indications 
are consistent with the qualitative feature that the diagonal elements of the 
CKM matrix (12.39) are the largest, and the magnitude of the matrix 
elements decreases as the element moves away from the diagonal 


(Uaua ~ U.s ~ Us) >> (Ua ~ Uss ~ Uus ~ Uc») >> (Uu ~ Ua). 
(12.47) 
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Thus, all KM angles 0; are small. This implies the dominant decay chain 
t>b->c>s. 


Accordingly it is meaningful to talk about fermion families even in terms of 
the mass eigenstates (u, d), (c, s), (t, b), etc. as intrafamily couplings are the 
strongest. 

In the KM parametrization (12.40) the CP violation phase ô appears only 
in the heavy quark sector. This is a convention. Clearly one can move ô to 
other sectors by redefining the phases of the quark fields. This means that 
one must involve more than one matrix element in the mixing matrix U to get 
CP violation. Physically this corresponds to the fact that CP violation comes 
from interference between amplitudes with different CP eigenvalues. 


Phenomenology of K°-K° mixing: some basic parameters 


We will give a brief description of the K°-K° system in order to set up the 
framework to study the CP violations in the standard model. (For details see 
Marshak, Riazuddin, and Ryan 1969; Kleinknecht 1976; Wolfenstein 1979.) 

In a beautiful application of quantum mechanics Gell-Mann and Pais 
(1955) first discussed the decay of neutral K-mesons. The interesting feature 
is that, because the strong interaction conserves strangeness while the weak 
interaction does not, the neutral kaon eigenstates with respect to these 
interactions are different from each other. In particular, the strong- 
interaction eigenstates K° and K° can mix through weak transitions such as 
K°22x2K°. In this K°-K° system, the K° state is defined as the CP 
conjugate of K°, 


IK°) = CPIK®). (12.48) 


We will describe the weak-interaction-induced transition in the K°-K° 
system by the S-matrix elements, 


a lT exp( =i [ro ar) (12.49) 


where a, «' = K° or K° and Hi(t)=e'"H, ee” where H, is the weak 
Hamiltonian. To second order in H,,, we can write the transition matrix (T- 
matrix) element T,,, as 


Sa'a — Sa! = — 2M T yy! (Ea) (Ey =- Ex) (12.50) 
where 


Taw (Ea) = (0'|Ay|a> — 3 | dt<a'|T(H x(t) H (0))a> 


ly | <a'|A,|A>D< ALA |e 
= (a'|H. fi De ne AA 
EE | E, — E, + ie 


EIEH] 
E,— E, +1 
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Since K° and K° are eigenstates of the strong interaction, we have for «, a’ at 
rest 


<a Hala) = Ôn Ma (12:51) 


Thus combining eqns (12.50) and (12.51) we can represent the matrix 
element of the ‘effective Hamiltonian’ in the K°-K° system as 


o'|H le) = ag = Mx Oga + Tyq(Mx) (12.52) 
For convenience, we will write the matrix element in eqn (12.52) as 
H H 
H= í = 2) (12.53) 
H3, H32 


where the subscript 1 refers to the K° state and the subscript 2 refers to the 
K° state. Using the formula 


1 1 
ESSE e = 12.54 
y—a+ie y—a a ( ) 


we can decompose the matrix element of Hr into real and imaginary parts, 


B 
H=M—-iz (12.55) 
Or 
A, = M B 
aa T aa’ 2 aa 
where 


l 
M aw = Mpk Daa! T <a | Hyla z PY oo a Cay |A>< ALA | 
z Mg — E, 


(12.56) 
Law = 27 X <a'|Ay|A><ALH la> ÒE, — mx). (12.57) 
À 
Note that I, M are hermitian matrices 
t=T, M'=M (12.58) 


because of the hermiticity of H, and H,,, while the ‘effective Hamiltonian’ 
H.x is not hermitian. From CPT invariance, one can show that 


Hi, = Hp» 
or 
EF = I3 and Mı — M22. (12.59) 


It is straightforward to diagonalize the Hamiltonian given in eqn (12.53) and 
the resulting eigenstates and eigenvalues are 


H|K, s> = ÀL, sK, s% (12.60) 
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where 
As = Hy, + (Hy2H2,)? (12.61) 
AL = Ay, = (H,,H>,)? 
l a 
IKs) = JA + le?) [A + e)[Ko> + (1 — €)[Ko>] (12.62) 
l = 
IKi> = Ja + la) LA + €)[Ko> — (1 — €)|Ko>] 
with 
_ VA. — J Aa 


a JHy2 + JA Cee) 


The real and imaginary parts of A, s are, respectively the masses and decay 
rates of K, s 


i 


ÀL, s =M, s ~ 5 PLS: (12.64) 
Note that CP invariance will imply that 
M,> = M21 and l2 = EF (12.65) 
or 
H; = Ay, 


which will give e = 0 from eqn (12.63). Together with the hermiticity con- 
dition in eqn (12.58) this will imply that M;; and T;; are all real if CP is a 
good symmetry. Since the observed CP violations in the neutral kaon system 
is small, we will make the approximations that Im M,, « Re M,,, ImT,, 
« ReI’,,, and e is small. Then we can write 


_ H,, — Hy, 
4./Hy2Hy, + (VB — JA)’ 
= Hy, — Ay, 
4./H 12H), 
~~ (2 Im M,, — i Im T2) 
2i(A, — As) 


E 


(12.66) 


The parameter € is a measure of the CP violation in the physical states of 
the K°-K° system. From eqn (12.57) one can show, by examining the 
contributions coming from all possible intermediate states 4 = 2m, 37, 
nmev..., that |ImI',,| « |Im M,,|. Then the difference in masses and decay 
rates can be approximated by 


—Ay = yi — Ys = 2012 (12.67) 
Am = m, — ms = 2|M,,| > 2 Re M> (12.68) 
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as Im M,, is small. The € parameter can then be written as 


—Im M32 


e = ———__— 
4Ay — iAm 


(12.69) 


It is one of the magic properties of the K°-K° system that with m, being so 
close to the 3x threshold, and (reflecting the smallness of e) with the 
dominant decay modes Kg and K, being the CP-even 22 and -odd 3z, 
respectively, K; and Kg have very different lifetimes 


Ys = 500%; œ Ay. (12.70) 


This allows for a clear separation of these two eigenmodes in the laboratory 
and made possible the eventual discovery by Cronin, Fitch and their 
collaborators (Christenson et al. 1964) of the very small decay mode 
K, — 27. Detailed study of the interference between the K; and Ks waves 
allows us to infer the extremely small mass difference Am = m, — ms 


Am/m, = 0.71 x 107+4 (12.71) 
which corresponds to 
Am ~ $s. (12.72) 
We can then deduce the phase of ¢ (confirmed experimentally) 
arg € œ tan” '(2Am/ys) ~ 45° (12.73) 


and write its magnitude in a simple form 


1 Im Mi 
Jel zm (= 7): (12.74) 


It is also clear from (12.72) that Am is a G2m? effect and we shall show how 
such a result can be understood with the GIM cancellation mechanism. 

The basic CP violation parameters that have been measured are the 
amplitude ratios 


N+- = (nn |H KL) a |Hy|Ks> 
Noo = LTT |, |K,>/(1°n|HylKs> (12.75a) 
and the ‘charge asymmetry’ 


_ T'(K, => n+l y) m T(K. > n 1*v) 


one r(K, > ntl-v)+TK, > n-I*v) 


(12.75b) 


By isospin decomposition of the final 27 state into J = 0 and J = 2 parts, one 
gets 


N+- = (€ + £2)/(1 + 274) 
Noo = (Eo — 2€2)/(1 — 24a) (12.76) 
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where 
€o = CU = OHK) = O|H,|Ks>, 
2 = 5 (I = AAK OKI = OF IK), 
and 
w = lI = 2|H,|Ks)/<I = OH |Ks). (12.77) 


Because of the validity of the AJ = 1/2 rule for CP-conserving decays œw « 1 
and it can be neglected. Furthermore, we can parametrize the K° > 2n 
amplitudes as 


CI = n|H,|K°> = A, e” (12.78) 


where 0, 18 the mz phase shift in the J = n channel coming from the final state 
interactions. A commonly adopted phase convention (Wu and Yang 1964) is 
to choose A, to be real, 


Im A, = 0. (12.79) 
In this case, it is easy-to see from eqns (12.62), (12.77), and (12.78) that 


Eo = £ 
by = eons mA =p", (12.80) 
Therefore, 
Npa EE, 
Noo = E — 26 
ô = Ree. (12.81) 
Experimentally we have 
le] = 2 x 107? (12.82) 
le’/e| < 1/50. (12.83) 


All these results are consistent with an early proposed theory of Wolfenstein 
(1964) which relegates all CP violations to a AS = 2 superweak interaction. 
We shall demonstrate below that the standard electroweak theory with KM 
mixings can yield numbers mimicking the superweak theory results. 


AS = 2 effective Lagrangian for free quarks 


The strategy, originally due to Gaillard and Lee (1974), is to construct first an 
effective AS = 2 Lagrangian from the free quark model and then to sandwich 
this Zir between the K° and K° states to obtain Am through (12.68) and Jel 
through (12.74). 

One first computes the box diagrams of Fig. 12.5 with intermediate quarks 
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a| 


J S 
(b) 


Fic. 12.5. Feynman diagrams for the AS = 2 transition amplitude sd —> sd. The ‘annihilation’ 
term of (a) is equal to the ‘scattering’ term of (b). i,j = u, c, t. 


i,j = u, cC, t in the approximation of taking all external momenta to be zero 


(as they are small compared to Mw and heavy quark masses). In the 
Feynman-’t Hooft gauge, we have 


teak, g \* dtk (_ -i _ y 
iT(8d > sd) = (5) EE; | One (ose 


~- ,ykim, , _ yrRim, 
(aar Bae Ts) (dares) 


(12.84) 


where 
C= UUS (12.85) 


where the Us are elements of the CKM mixing matrix (12.39). First 
concentrate on the momentum integration given by 


Lj) = dtk kkp 
POE E= MIE = mE? = m3) 
—in? 
= 4M?, A (Xi, X;)Jag (12.86) 
where 
I(x.) — J(x. 
Aba, x) <2 AID. 
Xj [os Xj 
l x? In x. 
Ie) =] L 12.87 
ee ea (12.87) 
and 
m? 
Xi = M2, 
Using the identity 
PEP = gy + gy — gy — iet PYsYg, (12.88) 


we can evaluate the Dirac matrices 
Lyd — Ys)/21. -Laal — Y5)/2] 
= Af yy" — 75)/21. -- Lya — Y5)/2]. (12.89) 
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Substituting the results of (12.86) and (12.89) into (12.84) 


Gp a 


Kedsedee erat 


(dy y"s,)(d.y,S1) 2 Cis; A(x; > x;). 
n (12.90) 


After taking into account the (2!)(2!) Wick contractions, this is an amplitude 
that can be obtained directly from an effective Lagrangian 


E E R (a) 1» (12.91) 
J2 16r \ My sin Ow 
where 
Oy = [dy"(1 — ys)s][dy,(1 — y5)s] (12.91la) 
and 
A= My > éA, x;). (12.91b) 
i,j 


QCD gluonic radiative corrections can, in principle, be taken into account. 
However, given all the other uncertainties, such corrections are not expected 
to change our conclusions materially and we shall ignore them. 


The K,—Kg mass difference and GIM cancellation mechanism 


Given the effective Lagrangian (12.91), we can calculate Am through eqn 
(12.68) 


Am = —2 Re [<K|— #48-|K)]. (12.92) 


There are two aspects to this calculation. The first is the estimation of the 
matrix element <K|0,,|K>; the second is the evaluation of the c-number 
(12.91b) corresponding to the sum of products of mixing angles and quark 
masses. 

To get an order of magnitude of the size of (K|@,,|K>, we make the 
‘vacuum saturation’ approximation 


5 = = 8 A $ z 
CKi[dy"(1 — ys)sJLdy,( — ys)s]|K> = 3 <Kidy"yssl0><Oldy,7ss|K 


_ 8fkmk 
3 Wm, 


(12.93) 


where fg ~ 1.23 f, is the kaon decay constant; the factor (2m,)~ +} arises 
from the normalization of the state. The factor 8/3 corresponds to the four 
ways of Wick contraction times a colour factor 2/3. The hope in making such 
an approximation is that the simple vacuum intermediate state will give us a 
representative value of the four-fermion matrix element 0,,. A somewhat 
more realistic calculation of this matrix element in the MIT bag model yields 
a'value about half the size (i.e. the same order of magnitude) as resulting 
from this vacuum insertion calculation (Shrock and Treiman 1979). Even so, 
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there are still uncertainties as to whether these quark diagrams really 
dominate the low-energy parameters such as the K; — Kg mass difference 
(Wolfenstein 1979). Ignoring these complications, we combine eqns (12.91) 
and (12.93) to get 


Gp a fxm 
Am = J2 ba ae re Rel X €i¢;A(Xi, Ji (12.94) 


At first sight it appears that, as the right-hand side is O(aG;), it will yield 
much too large a value for Am. This is where the GIM cancellation 
mechanism comes in. Since x; « 1 we can expand A(x;, x;) of (12.87) in x; 
and x;. The leading constant terms cancel in the summation because of the 
unitarity condition 


Yo =) UUs = 0. (12.95) 


In the remainder, the dominant terms are proportional to x; 


Xi 


X. 
Sebati S a fe (12.96) 
i,j i ižj Xi Xj Xj 
Thus we see that the factor x; converts Gra to Gm? 
Graxč, ~ G2mré,. (12.97) 


Taking m, = 0, we may write 


m — m, 2 Gr m, 


eel A 
m — 3 J2” “4x (37 GeV 


2 
) sin? 0, cos? 0, X (12.98) 
with 


X = (sin? 0, cos? 0.)~ Rel U..US) + (Us U) (M/m) 


2 2 
FUU% U E (=) (12.99) 
Mm — M 
where we have used eqn (11.100). In the four-quark model, where we have a 
simple Cabibbo rotation (12.37) with (U.,U%*,)? = sin? 0, cos? 0., the factor 
X = 1 in (12.99). This is how Gaillard and Lee (1974) before the discovery of 
J/w first estimated that m, ~ 1.5 GeV. Even though m, is expected to be very 
large (220 GeV) still it is multiplied by a small mixing angle factor U,, (see 
eqns (12.45) and (12.47)) and we do not expect X to deviate significantly from 
unity. Thus the result in (12.98) must be regarded as a remarkable triumph of 
the GIM mechanism as embodied in the standard electroweak theory. 
Before concluding this subsection on the GIM cancellation mechanism, 
two remarks are in order. 


Remark (1). For a heuristic understanding of how the GIM cancellation 
comes about one may use the concept of mass insertion to view these GIM 
loop calculations (see for example Cheng and Li 1977). Mass insertion can be 
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thought as either the simple expansion of the fermion propagator 
ee Sts (12.100) 


or as treating the fermion mass matrix (W,,m,;W jp + h.c.) as a perturbation 
on the symmetric theory (where all fermions are massless). Such a mass 
insertion in an internal line flips the helicity of that particle and may also 
change the identity of the weak eigenstate if one of the off-diagonal elements 
in m,; is used. For example the basic mechanism for s +> d transition is shown 
in Fig. 12.6. The couplings in these diagrams, by inspection, must be 
proportional to (see further comment in remark (2) below) m,m, = 
x, U4,;U%m? which is just the leading term in a GIM cancellation. (Our 
example of the Am calculation is slightly complicated by the fact that it is a 
AS = 2 transition.) 


>— 
di UL IR CL SL 


Fic. 12.6. The basic mechanism for the s d transition. All particles are weak-interaction 
eigenstates with į = u, c, t. Hence, the mass matrix has nondiagonal elements. 


Remark (2). Whether the GIM mechanism produces a power suppression 
factor X, Ua U*(m?/M4) which vanishes in the limit where all the m;s are 
equal depends on the convergence properties of the particular loop 
integration under consideration. There are cases where the GIM cancellation 
is much milder than this power suppression. It may take on the form 
È, Ua Už ln(m?/M&) which also vanishes in the equal-mass limit. An 
example of such a logarithmic GIM cancellation is the sd transition 
charge radius, which is physically relevant for example in the process 
K > ne*e” (Gaillard and Lee 1974). In the °t Hooft-Feynman gauge the 
leading contribution comes from the diagram in Fig. 12.7(a). To lowest order 


in Mẹ’ the W-propagator may be approximated by 
z ; (12.101) 
k? — M} M} 


One then makes a Fierz rearrangement (see Appendix A) in the resulting 
V-A four-fermion interaction 


SLY" git (qi Y dı) = (Spy"d (Git? aqil). (12.102) 


Thus we can calculate the transition vertex directly from the diagram in Fig. 
12.7(b) 


F, p"; 4) = ( i; e)( 1S ara Del) (12103) 
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s(p) d(p’) 


y(q) 


(a) (b) 
FıG. 12.7. The leading contribution to the sd transition charge radius. (b) is the Fierz- 
rearranged V—A four-fermion approximation to (a). 


where 


d*k i i 
I) = (Fr oe a 


a 12.1 
yk =m, "yk 4) —m, eae 


I,, is just the familiar vacuum polarization tensor with the result that, for 
2 2 
Mm; > q’, 


l l 
Lali) = 5 (Mds — Gay PUn A — Inn? /q?y]- (12.105) 


Again the (divergent) constant terms cancel because 2, é; = 0 leaving behind 
a mild GIM suppression factor ©, é; In(m?/q7’). 


The kaon CP-violation state-mixing parameter and the CKM angles 
The CP-violation parameter € of eqn (12.74) is given by 


_ 1ImcK| — 27K) _ 1 (Im() 
e S= S RecK)— PSARY ~ 2 (Feu) Cee) 


where the four-fermion matrix element <K|@,,|K> is divided out and 4 is 
given in (12.91b). Again in the limit m, = 0, we can use (12.95) to get 


Im(A) = 2 |e’ + Exépmy 


mém? m? 
+ (Cadi + CuSer) cee In (=) (12.107) 
mMm T Mg mg 


Re(2) = E E +(E%, — Bm 


2 22 x 
A EE E in) (12.108) 


m? — mM, 


where we have used the notation č; = Im č; and č;ę = Re €;. With the KM 
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parametrization given in (12.40), we note that 
Cu = — Če = C284$2$3 Sin Ò. (12.109) 


This allows us to factor out €, in (12.107). Thus Im(A) and hence the CP 
parameter e itself, is proportional to this combination of KM angles 


E€ OC $8583 SIN Ô. (12.110) 


Then the CP-violation parameter is suppressed by all the KM angles. Since 
we have some grounds to expect all the 0;s to be small, this elegant theory of 
CP violation naturally gives us a small e. 

For definiteness, we shall write out eqns (12.106)—(12.108) for the case of 
small KM angles. Except for the overall č, we shall drop all such terms and 
approximate 


Čer S —S84C1C3C3 
Čir &% —841C183C3. 
The expression for čr is only valid for the special case 
S2 > S3. (12.111) 


Such a situation is consistent with, but not demanded by, present experi- 
mental constraints. We have 


Im(A) = Deal čar = Epm? 


22 2 
+ (Čir — Čer) in (=) (12.112a) 


2 2 
M — M: c 


2 222 2 
Re(A) ~ | eam + Em? + et In = (12.112b) 


Me 


Cc 


Substituting (12.112) into (12.106), we have (Ellis et al. 1976a,b, 1977) 


s3(1 + n Inn) — c3n(1 + In 2} 


12.113 
s4 + c3y — 2s3c3ny ln n ( ) 


€ œ C,S,S, sin al 
where 4 = m/m? and some higher-power terms in y have been dropped. 
We conclude this discussion of CP violation with three brief comments. 


(1) Calculating ©’. The calculation of e’, the CP-violation parameter in 
decays (as opposed to e in the state), is less certain as it depends on our 
theoretical understanding of the nonleptonic weak decays. In most ap- 
proaches ¢’ is found to be very small (much too small to be detectable). 


Fic. 12.8. Penguin diagram for sd > dd with i = u, c, t. 


12.2 Weak mixing angles, the GIM mechanism, and CP violation 385 


However if the K — 2n decay is dominated by the one-gluon exchange 
‘penguin diagram’ (Gilman and Wise 1979) of Fig. 12.8, the calculated ratio 
|e’/e| is fairly insensitive to the values of the mixing angles and is predicted to 
be 0.01 to 0.03. Such a value may be accessible to verification by the next 
generation of experiments. 


(2) Neutron electric dipole moment. This is another important area where 
the standard theory with CP-violating complex CC couplings can simulate 
the superweak model. Both predict an electric dipole moment for neutron 
d,/e much smaller than the already extraordinarily stringent limit of less than 
10~7©cm (Ramsey 1978; Altarev et al. 1981). The basic reason for the 
smallness of this quantity in the standard electroweak theory is again related 
to GIM cancellation: the one-loop contribution such as Fig. 12.9(a) vanishes 
because it is ‘self-conjugate’ and there is no CP phase, 2, U,;U* = 1. In such 
diagrams whatever the phase at one vertex it will be cancelled by the opposite 
phase coming from the other vertex. At the two-loop level we have the 
possibility of ‘non-self-conjugate’ diagrams, Fig. 12.9(b). This results in an 
electromagnetic vertex with 4-momentum transfer k as 


ed,(k*)ito, ,k’y uA". (12.114) 
The form factor d,, being the electric dipole moment in the static limit, is 
suppressed by the usual GIM factor of Am?/M¥ = X; U,,U*,m?/My. 
However Shabalin (1978, 1980) has shown that even this vanishes in the 


k — 0 limit as signified by the cancellation between the two sets of diagrams 
shown in Fig. 12.9(c). The CP odd part is proportional to 


l l l l 
ER E EE E A 
o p-m p+k-m n| 


u i u u q j q’ u 
(a) 


l 
(b) 


W(r—p) 


u(r) q(p) q'(p+k) u(r+k) 


y (k) 
(c) 
Fic. 12.9. Diagrams for induced u-quark dipole moment: the photon line is to be attached to all 
the charged lines in (a) one-loop and (b) two-loop diagrams. In (c) the black dots represent the 
quark e.m. transition vertices. Being at least one-loop, the graphs in (c) are actually two-loop 
diagrams, and they cancel in the k > 0 limit. 
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where I’,,,s are the regularized quark electromagnetic transition moments, 
and it can be shown that up to one-loop 


lig = Lip 


In the static limit, k > 0, we can drop the momentum k in the quark 
propagators and clearly the above contribution vanishes. Therefore, one 
must include higher order radiative corrections, the leading term being that 
due to QCD gluons. Thus the neutron dipole moment is expected to be of the 


order 
4 4 
A g m, 
< o ee A LC A 12.115 
= ma( OTME ta ( ) 


The external factor of m, a reflects the helicity-flip nature of the dipole- 
moment operator. The numerical value generally quoted is 


Id Je] < 10733 cm. (12.116) 


dr 
e 


When one considers graphs with exchanges among different valence quarks 
in the neutrons, generally the same bound is obtained. However a possible 
large contribution may come from those involving Penguin diagrams; the 
GIM suppression being only logarithmic, they lead to a (d,/e) value as large 
as 10° *° cm (see Gavela et al. 1982 and references cited therein). 


(3) Hard vs. soft CP violations. In the standard theory there is only one 
Higgs doublet so that the vacuum expectation value <®), has to be real 
because any possible phase had no physical significance and can be rotated 
away. The sources of the complex mass matrices are the Yukawa couplings 
themselves. This is usually called hard CP violation in the sense that it is an 
effect due to dimension-four operators. This is in contrast to another class of 
CP-violation theories where the violation arises from spontaneous symmetry 
breaking—hence a soft variety (T. D. Lee 1974). To get spontaneous CP 
violation, one has to extend the Higgs structure (e.g. two complex doublets) 
to get the complex vacuum expectation value. Physically the difference 
between hard and soft CP violation is that soft CP violation effects disappear 
at energies higher than the energy scale of the symmetry breaking, while hard 
CP violations will persist. One should also point out with more than one 
Higgs doublet the Yukawa coupling is naturally nondiagonal in the flavour 
space unless some discrete symmetries are imposed. These couplings gener- 
ally tend to induce too large a rate for the strangeness-changing and muon- 
number nonconserving neutral-current effects (also see remarks in $11.3). 


12.3 The W and Z intermediate vector bosons 


The most basic and distinctive feature of a gauge theory with spontaneous 
symmetry breaking is the existence of a set of massive gauge bosons. In the 
standard model, there are three such intermediate vector bosons: W+, W7, 
and Z. Here we discuss in turn their masses, decays, and possible production 
mechanisms. 
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Masses 


The W and Z masses can be expressed in terms of the Weinberg angle Oy as in 
eqns (11.100) and (11.82). 


ie e yea 37.3 GeV 
My =~ = 12.117 
"2 E z) sin w sin Ow l ) 
Se A 74.6 GeV 
TE Oe Sen E aAA (12.118) 
J2 Gf sin 20y sin 20w 


Experimental knowledge of Oy will give precise predictions for these 
masses. Since these gauge bosons are rather heavy, at present we have only 
seen their virtual effects in the low-energy charged- and neutral-current 
phenomenology. In the standard model, the Higgs particles are in the 
SU(2) x U(1) doublet representation, My and Mz are related (see eqn 
(11.82) and eqn (12.158) below) 


Mw 


oe ee 12.119 
M? cos? Ow l ) 


p 
This fixes the relative strengths of the charged- and neutral-current reactions. 
From the data on the neutrino neutral-current processes a two-parameter fit 
to the standard model yields (Kim et al. 1981) 


p = 0.998 + 0.050 (12.120) 
sin? Oy = 0.224 + 0.015. (12.121) 


The fact that p is very close to one lends support to the standard model. Of 
course this does not exclude the possibility that there can be more than one 
Higgs doublet or that there can be other representations of Higgs particles 
with small vacuum expectation values (except singlet Higgs scalars which do 
not couple to the W or Z). 

If we restrict ourselves to p = 1, an average of the neutrino data yields 


sin? Ow = 0.227 + 0.010. (12.122) 
Then the intermediate vector boson masses are 
Mw = 78.5 + 1.7 GeV 
Mz = 89.3 + 2 GeV. (12.123) 


The fact that these masses are confined to a rather narrow range provides us 
with a clean experimental test of the standard model. 

A more precise determination of sin? Ow and the gauge boson masses 
requires the inclusion of radiative corrections of the weak processes involved 
and of the energy dependence of coupling constants (see §3.3). Since the 
Weinberg angle is defined through the coupling constant (eqn (11.80)), it will 
also depend on energy. After radiative corrections, the value of sin? 0y at Mw 
is found to be 


sin? Oy(My) = 0.215 + 0.10 + 0.004 (12.124) 
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where +0.004 reflects the theoretical uncertainties in the radiative correction 
calculations (Marciano and Sirlin 1980; Sirlin and Marciano 1981; Llewellyn 
Smith and Wheater 1981). If one uses the fine structure constant at My 


—___ = 127.49, (12.125) 
a(Mw) 
the gauge boson masses come out to be 
38.5 GeV 
= ——_———~ = 83.0 + 2.4 GeV 12.126 
w= sin Oy(Mw) ae eine) 
Mz = M,,/cos Oy = 93.8 + 2.0 GeV. (12.127) 


Thus the higher-order effects increase the Mw and M;, values by about 5 per 
cent. It is anticipated that Mz will be measured to within 0.1 to 0.2 GeV in the 
high-energy e*e7 machine, thereby probing the higher-order electroweak 
radiative corrections. 

We now discuss W and Z decays. We shall see that the lifetime of these 
particles will be very short as they decay ‘semi-weakly’. 


W decays 
The couplings of W to the fermions are given by eqns (11.86) and (11.121) 


e 
Ly x aN (Ve; Vis vy" (d 7 Ys) H 
T 
d 
+ (0,6 Dy —y)UL s | [4 he. (12.128) 
b 


where U is the Cabibbo—Kobayashi-Maskawa matrix, (12.39). Using 
(12.128) we can calculate the rate for various decay modes. 
Consider the example 


W (k) > e(p) + W(q). (12.129) 
The matrix element is given by 


g 
(an 
e 2/2 


where e”(k) is the polarization vector of the W. Summing over the spins of the 
fermions and averaging over the W polarizations, we have 


U(P)? (1 — ys)v(q)e"(k) (12.130) 


l g? o kk 
=X ITS = [Pady + Prga — InP DI LI + za 
3 3 My 


spin 


g°M3, (12.131) 
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where we have neglected the electron mass. The decay rate for W at rest is 
d3 p d3 q 


1 l 
L= sae (E-D E T 


2Mw 


3 
a = ee (12.132) 


This decay rate is proportional to the Fermi constant Gp, rather than to G3 
as in the usual weak decays; hence we say it is a semi-weak decay. 

Clearly all other leptonic decays W~ > pv, and W~ —> vv, will have the 
same decay rate: I, = T, = IT.. For the hadronic decays, we can calculate the 
rate of decay of W into a quark pair which then hadronizes with unit 
probability 


(W > 1,f,) = 3|U, 70. (12.133) 


where n; = d, s, b and p; = u, c, t, with I’, as given in eqn (12.132). The factor 
of 3 in front is due to the colour degree of freedom of quarks. W > n;P; 
represents all decays into hadrons having the same quantum number as 
n; + p;. For the light quarks decay products we expect them to have a two- 
jet structure, modified occasionally by the emission of a gluon jet. The total 
decay rate can be simplified by using the unitarity property of the U-matrix 
È; |U; l? = £; |U; = 1 (assuming m, « My) 


Ea = 3I. F > 3U; T. = 127. 
i,j 


2 GrMÌ 
Z —— = 5.23(My in GeV)? x 1076 GeV. (12.134) 


Thus for My = 83 GeV, the total width is 
IT = 2.99 GeV (12.135) 
with the leptonic branching ratios 


_T(W > ev.) 1 


E a A A S -B =B.. 12.136 
° T(W >al 12 AR : 


u 
The large width of (12.135) reflects both that W decays semiweakly and that 
My, is large, so that there is plenty phase space. 


Remarks 

(1) In the standard model we have three families of leptons and quarks. If 
we generalize it to N families and if all of them are much lighter than Mw, the 
total width and leptonic branching ratio are given by 


2 NGM} 
r nT, oe 
3n 
1 
B. = —. 12.137 
«= aN ( ) 
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(2) In the calculation of the hadronic decays we assumed that quark 
masses could be neglected. This may be a poor approximation for decay 
modes involving the t-quark if m, turns out to be substantially larger than the 
present experimental lower bound, ~ 20 GeV. 

(3) Careful studies of the hadronic decay modes can yield useful 
information about the mixing matrix U,,. 

(4) It is clear from the Feynman rule that even if the Higgs mass 
my < My, W cannot decay directly into the Higgs p° through the lowest-order 
Lagrangian. An interesting possibility of W — °W, involving a second- 


order diagram is shown in Fig. 12.10(a). At first one may expect that in 


Pe aes, U-? 
WK) Py Z 7 


Fic. 12.10. 


addition to the extra coupling g there will be further suppression coming 
from the W-propagator ~M,”. But the WW6o coupling has a factor My; 
this and an additional factor coming from the three-body phase space will 
cancel the W-propagator suppression. Hence we expect this decay rate to be 
only down by an order of g? compared to W > ev, and this could be useful in 
searching for the Higgs particle 6°. The matrix element is given by 


go’ My 1 
I's = 2. M2 
2/2 (p+q) — M~w 


Summing over fermion spins and averaging over the polarization of W, we 
get 


ü(p)y, (l — ys)v(q)e". (12.138) 


4 2 
Sipe 9 E E E 
(12.139) 
The decay rate is given by 
4 s4 = 
C= My [on on(k — q) 
d?/ d? 

à SA T]. (12.140) 


* Ca) (2)°2po Ere 2qo 3 
Using the formula 


F dp d°q 
re 6°(Q — p — q)p*q’ 
Wo 240 


y 2 a a 
a7 (Q°g® + 200°) (12.141) 
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with O = k — l, we get 


Vy oy 2 Í ami 


= 72 6n 16n? | |My 


23 
ee 2.142 
aR a (aa 


where we have made the approximations m, = 0 and m, « Mw. Comparing 
this to W —> ev., we have 
(WT > pev) _ a 2m, 
T(W- > ev.) 4r sin? Oy 


$ al (12.143) 


This indicates that if ° is much lighter than W, the logarithm factor can 
give an enhancement. Experimentally this decay mode into b° will be difficult 
to observe because of the missing neutrinos and the short lifetime of the °. 


Z decays 


If we parametrize the coupling of Z? to any fermion f as 


Gey: 1 
Lyc = mà) u (g\ = gays)fZ" (12.144) 


then the width for each decay model is given by 


G-M3 
24r ./2 


I(Z > ff) = (IgV? + |gal?). (12.145) 


In the standard model, we have 


gy =Gv =gv = —14+ 4 sin’ Oy, Gr=Gro=GQr,=—l 
I=] = gy = 1 — 3 sin? Oy, ga=ga =g =l 
gy =g = g= —l +$sin? ðw, gk=gi=gi=—1 (12.146) 


and gy = ga = 1 for all neutrino flavours. Eqn (12.146) is obtained from eqn 
(11.95) with gy = 2(g, + gr) and ga = 2(g, — gr) and universality. We 
note that with the experimental value (eqn (12.124)) of sin? 0w = 0.215, 
which is close to 1/4, the vector coupling of the charged lepton gy = 0.14 is 
small compared to the axial-vector coupling g, = 1. The partial width for 
decay into a neutrino pair of a particular flavour f is 


_ GM2 
eee a S 
= 2.2(M; in GeV)? x 1077 GeV. (12.147) 


For the Mz value in eqn (12.127), this gives 
Py, = 0.18 GeV. (12.148) 


The relative widths of Z — ff for fin one given family can be just read off 
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from (12.146) 
I(Z > wy): T(Z > 12):T(Z > uit): T(Z > dd) 
= 2:[1+ (1 — 4 sin? 6,)7]:3[1 + (1 — $ sin? 6,)7] 
:3[1 + (1 — $sin? Oy)7] 
= 2:1.02:3.54:4.53. (12.149) 
The total width is 
Tio = F(Z > all) = 24(1 — 2 sin? Ow + $ sint 6,)I,, 
= 3.0 GeV. (12.150) 
The branching ratio for decay into a pp pair is 
T(Zoptp) 1 ¢ — 4 sin? Ow + 8 sint a) 
r(Z >al)  24\1 — 2 sin? Oy + £ sint 6, 
= 3.06 x 107?. (12.151) 


In eqns (12.149)-(12.151) we used the (12.124) value of sin? Oy = 0.215. The 
branching ratio of (12.151) is not very sensitive to sin? Ow for sin? Ow around 
0.22. For example, as we vary sin? 0w from 0.2 to 0.25, B(T) goes from 
3.06 to 3.13 per cent. This will be a useful piece of information in our search 
for the hadronic-produced Zs. 


Remarks 


(1) If we generalize the standard model to N fermion families, the results 
in (12.150) and (12.151) become 


I'(Z > all) = 1.0 x N GeV 
0.092 
Biutu-) = ———. 
(up) N 


(2) While the decay Z > °° is forbidden by angular momentum and 
Bose statistics, we have the interesting decay model Z > buu (Bjorken 
1977). This is analogous to W > $°l¥, as in Fig. 12.10(b). Similarly to eqn 
(12.143), we obtain 


(Zp o’) | a | 2m 


(12.152) 


+a (12.153) 


I(Z > p'u) 4msin? Oy cos? 0w)  |My| 24 


Combining eqns (12.153) and (12.151) we get a branching ratio for this decay 
of order 10° * for mẹ = 10 GeV. The possibility of detecting this decay mode 
will be much more favourable when compared to W > ¢°ev because one can 
observe both .* and p~ and p° should show up as a bump in the missing 
mass plot. Also in the future ee~ colliding machines it is expected that 
10° Zs can be produced in a year, so that even decays with small branching 
ratios can be detected. 

(3) Since the Zs will be produced in large quantities, one might hope to 
study the rare decay modes of Z which involve loop diagrams. Those 
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reactions with diagrams involving the trilinear gauge boson couplings will be 
of particular interest in checking the non-Abelian character of the theory. 
Most of the rare Z decays have been studied and are found to be still too 
small to be seen in the near-future (Albert et a/. 1980). For example, suppose 
we wish to study a flavour-changing neutral-current decay such as Z —> uC in 
Fig. 12.11. Because of the GIM cancellation mechanism, the amplitude is of 
order 


2 
T~ ap) Uu UžmÈ (M3 | (12.154) 
which gives the branching ratio 
T(Z u 2/ mX 
a E (a lal (12.155) 
I (Z > cc) n) \My 


which is of order 107? even for m; as large as 10 GeV. 


Fic. 12.11. One-loop diagrams for Z — cu with j = d, s, b. 


Remarks on W and Z production 


(1) We will just mention that the cleanest way of producing W and Z is 
through e*e™ annihilations. The Z boson will show up as a sharp spike in an 
ete collision very much like the J/y or Y particles. At the peak of Z 
resonance we have 

o(e*e7 > Z? > all) 


Ree ee ~ 5000 12.156 
C o(ete” > y* > tp) ! l 


which corresponds to five events per second if the luminosity is 
10°? cm~? s7 +. We may also note that, reversing the situation encountered in 
the case of the J/y and Y particles, the width of the Z boson (=2-3 GeV) is 
much larger than the beam resolution (~100 MeV). So the shape of the 
resonance will allow us to deduce Tẹ directly. From this one can ‘count’ the 
number of neutrino flavours N, using the relation 


Piot E I visible T N I (Z Bá VeVp) (12.157) 


since we know the value of I(Z—v,v,) (eqn (12.148)) and Ty. iS 
determined by the observable cross-section at the resonance peak which, 
according to the Breit-Wigner resonance formula, should be proportional to 
Tere: ) x Duos X Tio - 

(2) Ifthe ee~ machine has Ys > 2My,, the Ws will be produced through 
the diagrams of Fig. 12.12. This e*e7 — W*W°™ reaction is of particular 
interest because it involves the trilinear gauge boson couplings and will 
provide an important test of the non-Abelian nature of the underlying gauge 
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theories (Sushkov, Flambaum, and Khriplovich 1975; Alles, Boyer, and 
Buras 1977). 

(3) In practical terms it is likely that the first production of W and Z will 
be in the proton—antiproton collisions through the Drell-Yan mechanism 
(Fig. 12.13). For a review see Quigg (1977). 


e wW 
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FIG. 12.13. 


12.4 The Higgs particle 


In the standard electroweak theory, we start out with a doublet of complex 
scalars. After spontaneous symmetry breaking three of the original four real 
scalar fields are eaten by the gauge particles and we are left with one neutral 
physical Higgs scalar °. (For reviews of Higgs phenomenology see, for 
example, Ellis et al. 1976a,b; Li 1980). 


General properties of the Higgs particle 


The Higgs scalar of the minimal Weinberg—Salam model has the following 
basic properties. 


My = M, cos ẹṣ. As we have mentioned before (see, for example, eqns 
(11.82)-(11.84)) this relation follows from the doublet structure of Higgs 
particles and is well satisfied experimentally. Still, one cannot exclude the 
possibility that there can be more than one doublet in the theory. In fact there 
are other Higgs structures which also satisfy this relation (Tsao 1980). For a 
general SU(2) x U(1) multiplet of Higgs particle ġy y with weak isospin T 
and weak hypercharge Y, the p parameter is given by (Lee 1972c) 


Mw B Zr y ltr, y [T(T + 1) — ¥7/4] 


So e 12.158 
M2 cos? Ow 22rb YA ( ) 


p 
where vr y = (Old; y|0 is the VEV of the Higgs particle. The requirement of 
p = 1 for arbitrary vr y means 

T(T+1)=2Y?". (12.159) 
Examples of solutions to (12.159) are (T, Y) = (, 1), (3, 4), (8, 15), .... 
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Higgs couplings to fermions. The Yukawa coupling of the Higgs particle 
þ° conserves parity and fermion flavour, and its strength is proportional to 
the fermion mass. This can be seen as follows. We have the Yukawa coupling 


Ly =f Wi own + h.c. (12.160) 


where i, j are fermion flavour indices and the prime over the fermion fields 
indicates that they are weak eigenstates. The symmetry-breaking condition 


(eqns (11.63) and (11.90)) 
1 /0 
Po = 2 (o) 


v = (u? /A) = 27 14GF "2 ~ 250 GeV (12.161) 


with 


generates the fermion mass matrix 


Lu = My P aWir + h.c. with m;; Sij- (12.162) 


U 
= 
In this case, the fermion mass matrix is proportional to the Yukawa coupling 
matrix. Thus when we diagonalize the mass matrix 


Lu = Mij ‘LYR H h.c. = mWiaWir + h.c. (12.163) 


with y; being the mass eigenstates, we get diagonalized Yukawa couplings of 
the physical Higgs ° to the fermion fields, 


Ly = 24 /G, mo WaWin + hic. (12.164) 


which conserves fermion flavour and parity, and the strength of the coupling 
is proportional to the fermion mass. This property that mass and coupling 
matrices are proportional to each other is a consequence of the Higgs 
particles being in a single irreducible representation (Glashow and Weinberg 
1977). 

The experimental consequence of the proportionality of the coupling 
strength to the fermion mass is that the Higgs particle @° can be produced 
more easily by heavy fermions and will decay predominantly into the heavy 
fermion channels that are allowed by kinematics. The factor ./Gp in the 
Yukawa coupling (12.164) makes these coupling to the known fermions very 
small. Also the fact that these couplings conserve fermion flavours means 
that we cannot find their signature in the rare, but distinctive, flavour- 
changing processes. 


Higgs couplings to gauge bosons. These couplings, being proportional to 
gM, where M, is the mass of the gauge bosons W or Z, are much larger than 
the Yukawa couplings studied above. 


1 
Pow = (My WEWE + — maz,z") (12.165) 


2 cos Ow 


The quartic þ?VV couplings are listed in the Appendix. 
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Mass of the Higgs particle 


In the standard model the mass of the physical Higgs scalar particle is given 
by eqn (11.72) 


My = (2u)? = (2A)?v. 


While we know something about v, there is at present no information on the 
quartic coupling A. This lack of a precise knowledge of the Higgs mass makes 
it very difficult to search for 6° experimentally. Theoretically, there are some 
prejudices as to the range of my. If we require À to be less than 1 so that 
perturbation theory remains valid, we get from (12.161) an upper bound 


my < 350 GeV. (12.166) 


On the other hand, if å is too small, the symmetry-breaking vacuum will be 
unstable. This produces a lower bound as we will demonstrate below. 


The Linde—-Weinberg bound. The basic idea is that, if A is too small, the 
one-loop contributions (particularly those from the gauge-boson loop) to the 
effective potential in §6.4 become relatively important; they cause V(<o> # 0) 
to be greater than V(<o> = 0) and SSB disappears (Linde 1976; Weinberg 
1976a). In order to present the actual calculation we briefly summarize the 
results obtained in §6.4 and their generalization to the present case of SU(2) 
x U(1) gauge theory (Coleman and Weinberg 1973). 

The basic calculation in §6.4 involves summations of an infinite number of 
one-loop diagrams (Fig. 6.12) with scalar, fermion, and gauge bosons 
running in the loops. It is clear that the non-Abelian nature of the theory is 
not particularly relevant as gauge-field trilinear and quartic self-couplings do 
not play a role. So we can simply take over the results of §6.4 (see in 
particular eqn (6.164)). In order to make sure that differences in definitions 
and normalizations of couplings and fields are taken into account, let us first 
concentrate on the SU(2) gauge-boson loop. If we decompose the complex 
scalars in terms of the real fields as 


_ 1 fps tipa), 
“x2 ( + ~ a 
then 
l 4 
b? = o'b = 5 = b? (12.168) 
with 
{O79 = 24016 = w. (12.169) 


Since the effective potential can depend only on $? it will be adequate for our 
purpose to explicitly calculate only loop diagrams with external o,s. Since 
the gauge boson-scalar coupling is of the form $'(A,:A,)o, three gauge 
bosons will contribute equally. For each one, say A}, we can simply take over 
the scalar QED result of eqn (6.157). Keeping in mind that the , 28s in eqn 
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(6.154) are normalized differently from our o,s and using the Feynman rules 
in Appendix B, we need to replace e? by g?/4; we have 


3 2\2 
VOD = ga (F) otingyar +. 


or 


V(b?) = 5 i M4o*In62/M?2 +... (12.170) 


6n7v* 


where we have used MẸ = $g7v” and eqn (12.168). When the other gauge 
boson loops are included we have 


V (7) = gana Dio In @?/M?2 +... (12.171) 


The index V runs over the W* and Z vector bosons. Including the scalar 
and fermion loops, we finally obtain the one-loop effective potential similar 
to eqn (6.164) 


V (7) = Cot In ?7/M? (12.172) 
with 


l 
C= sea (3E mt + ms — 4E mi) (12.173) 


We should recall that the factor of 3 in the vector boson term comes from 
tracing the numerator of the gauge-boson propagator in the Landau gauge; 
the factor of 4 in the fermion term comes from tracing the Dirac matrices and 
the minus sign reflects the Fermi statistics. With 


V(b) = Volp) + Vi@) 
= —ph? + Ad* + Cot In (?/M?), (12.174) 


we can determine v by 


ae = 0 (12.175) 
Op b=v//2 
—p? + Av? + Cv?(In v?/2M? + 4) = 0. (12.176) 


The mass of the Higgs particle is given by 


67V 2 3 
i = z| a + c(in( + 5) | (12.177) 
b=e/ y2 


ae aes 
My 2 Ob? 
where we have used eqn (12.176) to eliminate the u? term. 
Now consider the case where 4 is very small so that it can be neglected in 
eqns (12.174)-(12.177). The value of the potential at the minimum ọ = v/ V2 
1S 


Pe ee eee 2 
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At = 0, the value of the potential is 
V(0) = 0, (12.179) 


when there is no SSB. Since we want (12.178) to be the absolute minimum, we 
must have 


Viv) < V(O). (12.180) 
This implies, through eqns (12.176) and (12.178), that 
In(v?/2M*)+1>0. (12.181) 


When we substitute condition (12.181) into (12.177) we have 


3a? (2 + sec* Ow) 


—— 12.182 
16./2 Gr sin* Oy ( ) 


Íi 
M e C a a 


where we have neglected m, and m, which are assumed to be small compared 
to my. For sin? Ow ~ 0.215, this gives 


my > 7.9 GeV. (12.183) 


Otherwise the radiative correction will make the asymmetric vacuum 
unstable. 


The Coleman—Weinberg conjecture. An interesting suggestion is that 
u? =0 in the standard model and SSB is driven completely by quantum 
radiative corrections (Coleman and Weinberg 1973; Weinberg 1979b). From 
eqns (12.176) and (12.177) we obtain 


my = (2C)Żv ~ 11 GeV. (12.184) 


Even though this is a precise prediction about m,, one must keep in mind that 
at this stage there is no compelling physical ground to have u? = 0 although it 
is an intriguing proposition with its suggestive simplicity. 


Production of the Higgs particle 


It is clear from the discussions on Higgs couplings to fermions and gauge 
bosons above that most of the promising mechanisms for producing the 
Higgs particles make use of their couplings to 


(1) heavy fermions; 
(2) gauge bosons, 


as they are less suppressed. 
In category (1) perhaps the most promising reaction 1s 


(a) Veg > O° + 9 


(Fig. 12.14) where Voo is the 1” QQ bound state of heavy quarks (Wilczek 
1977). Besides having a large fermion—Higgs coupling it has the additional 
advantage that the hadronic decay of quarkonium Vog is suppressed by the 
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Zweig rule. The decay rate has been estimated to give 


Vao > O° +) | GEM ms 
TVog > y > e*e7) 4/2 20 M? 


where My is the mass of Vogo. For My ~ 30 GeV and m, « My this ratio is 
about 8 per cent. One important feature of this mechanism is that the 
reaction produces a monochromatic photon and it may be a good 
experimental handle for its detection. 


(12.185) 


Q 
EE 


FIG. 12.14. 


In category (2), there are several interesting processes. 
(b) Z° >° + 
(see Fig. 12.15), where the f in the second diagram is some heavy fermion 
which has a large coupling to 6° (Cahn, Chanowitz, and Fleishon 1979). It 
turns out that for m; < My, the dominant contribution comes from the W- 
loop and the branching ratio is estimated to be 107° for mẹ « Mz. The photon 
coming from this decay is also monochromatic 


—-—# ——-¢° 


Fic. 12.15. 


(c)e’ +e + Z°+ 0° 
(Fig. 12.16). At ~s ~ 200 GeV even a Higgs particle of mass close to 
100 GeV could be produced with a cross-section larger than 10777 cm?. This 


corresponds to 1 event per day at a luminosity of 10°? cm? s~' (Ellis er al. 
1976a). 


Fic. 12.16. 


(d) Z => p? +e* +e. 


This decay has already been discussed in connection with Z decays in §12.3. 
The general feature is that b°? will show up as a mass bump in the recoil 


400 Standard electroweak theory II: phenomenological implications 12.4 


spectrum against the ete” pair. The estimate gives B(Z° > b°e*e ) = 107° 
for mẹ < 40 GeV. The related process W7 — $°ev is also calculated in 
§12.3; however this W-decay mode is rather difficult to detect. 


Charged Higgs particles 


In the standard theory, there is no physical charged Higgs particle. But in 
many of the extensions of the minimal model there are charged Higgs 
particles, hence a richer phenomenology (Donoghue and Li 1979; Golowich 
and Yang 1979). Even though their masses and couplings are not very 
constrained by the theory, if they exist, they can be produced in e*e7 
annihilations through photon exchange (Fig. 12.17). For energy well above 


Fic. 12.17. 


the threshold, the cross-section o(e*e” — o* 7 ) is 1/4 of the standard point- 
like cross-section o(e*e” — u* u`). Thus if there are several singly charged 
or doubly charged Higgs particles, their contributions to the e*e™ total 
cross-section should be significant enough to be observable. 

Also the charged Higgs particles will generate scalar or pseudo-scalar 
charged currents (McWilliams and Li 1981) which might contribute to low- 
energy charged-current interactions such as p > evy, n* > n°e*v, n > pey, 
m—>ev..., etc. Future high-precision measurements on these low-energy 
processes could shed some light on the properties of these charged Higgs 
particles. Any significant deviation from a V, A type of structure might 
indicate their existence. 


13 Selected topics in quantum 
flavourdynamics 


By the ‘standard model’ of strong and electroweak interactions one usually 
means the SU(3) x SU(2) x U(1) gauge theory with the SU(2) x U(1) 
electroweak group spontaneously broken down to U(1),,, by one doublet of 
the elementary Higgs scalar fields. The neutrinos are massless, There are 
three families of fermions and CP violations result from a complex CKM 
mixing matrix in the weak charged-current couplings. 

Several aspects of the standard electroweak theory may be revised and 
extended for various reasons. In §13.1 we discuss one particular realization of 
dynamic symmetry breaking based on an analogue to the QCD gauge theory of 
colour. Such schemes are referred to in the literature as technicolour models or 
hypercolour models and the purpose is to remove the elementary scalar field in 
the theory. In §13.2 the possibility of massive neutrinos is explored. We discuss 
the likely origin of neutrino mass terms and their phenomenological 
implications. We give details of a calculation (in §13.3) of u —> ey as a higher- 
order weak effect when there are massive neutrinos. The purpose is to illustrate 
the use of R.--gauge Feynman rules in one nontrivial case where the unphysical 
‘would-be-Goldstone’ bosons play an important role in maintaining the gauge 
invariance of the calculation. 


13.1 Dynamical symmetry breaking and technicolour models 


One attempts to replace the elementary Higgs scalars with composite ones 
(see, for example, Weinberg 1976b). A notable class of such models have the 
constituent fermions bound through gauge interactions that are modelled 
after the QCD theory of colour (Susskind 1979; Weinberg 1979a). Here one 
postulates a set of new gauge charges: the technicolours. One’s aim is to have 
a spontaneous symmetry-breaking theory with gauge interactions alone: 
there is no elementary scalar with its self-couplings and Yukawa couplings. 
Such a successful theory has by no means yet been constructed. However 
progress has been made towards the realization of such a programme. (For 
reviews see Beg 1980; Farhi and Susskind 1981.) Our purpose here is to 
illustrate the possibility of dynamical models of spontaneous symmetry 
breaking. We follow the presentation by Sikivie (1982). 


The motivation to replace the elementary Higgs scalars 


Higgs scalars are used in gauge theories to cause spontaneous symmetry 
breakdown. In the electroweak theory they generate masses for W and Z 


402 Selected topics in quantum flavourdynamics 13.1 


gauge bosons. For one doublet of elementary Higgs particles, one gets 
the correct relative strength between neutral and charged currents: 
My = Mz cos w. In the standard model the leptons and quarks also 
acquire their masses during SSB through Yukawa couplings. The different 
sizes of the fermion masses can easily be accommodated by having different 
sizes of couplings and the complex Yukawa couplings can give rise to CP- 
violating charged-current couplings through the diagonalization of the fermion 
mass matrices (see §§11.3 and 12.2). 

This ‘versatility’ of the elementary Higgs particle is related to the freedom 
one has in choosing the Higgs couplings: the scalar self-couplings and 
Yukawa couplings are quite unconstrained so long as they satisfy the 
requirements of gauge invariance. As a result a gauge theory with elementary 
Higgs scalars has many arbitrary parameters associated with the Higgs fields. 
This translates into the fact that in general masses and mixing angles cannot be 
calculated and must be introduced as parameters into the theory. 

Furthermore as we shall seein Chapter 14 0n grand unification that, when the 
standardSU(3) x SU(2) x U(1)modelis embedded into asimple gauge group, 
the above-mentioned arbitrariness is not muchimproved. On the other hand we 
have (see §14.2) the acute ‘gauge hierarchy problem’, which can be described as 
follows. Grand unified theories with group G require at least two stages of SSB 
corresponding to G > SU(3), x SU(2), x U(1) at energy scale M, and SU(3), 
x SU(2), x U(1) > SU(3), x U(1),,, at scale u. The scale pis fixed by the weak 
interaction strength to be 250 GeV (see eqn (11.90)). The scale M is expected to 
be of order 10!* GeV both as a result of the lower bound on the proton lifetime 
and because of the successful prediction of sin? Ow. If these SSBs are due to 
elementary Higgs scalars, the ratio u?/M? ~ 107 * has to be introduced by 
hand and readjusted to 24 decimal places in each order of perturbation 
theory (see §14.2). 

This motivates us to investigate the possibility of SSB without having to 
introduce elementary scalar fields. The notion of a composite Higgs scalar is 
really not a new one. We have already mentioned in §8.3 that the idea of the 
Higgs phenomena was first suggested by the theory of superconductivity. 
There the electromagnetic gauge symmetry is spontaneously broken by the 
condensate (i.e. non-vanishing ground-state expectation value) of the 
electron pairs (the Cooper pairs), which acts as an effective composite Higgs 
scalar. Thus SSB is brought on dynamically through the interactions of the 
electrons with the lattice phonons. (For an early work on dynamical 
symmetry breaking see Nambu and Jona-Lasinio 196la,b.) Therefore the 
question one faces here in the electroweak case is that, if Higgs scalars are 
composite, what are their constituents? What interactions are responsible for 
binding them together? How can we have all the desired patterns of SSB 
without introducing many arbitrary parameters? 


The basic technicolour idea 


One naturally wonders whether the QCD strong force which binds the 
coloured quarks can be the interaction responsible for SSB in the electro- 
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weak interaction. We shall see that, although it fails to fulful this role, the 
way it fails suggests possible candidate theories. 

Consider the standard SU(3), x SU(2), x U(1)y model, but—this time— 
without the elementary Higgs scalar. Also, for simplicity, let us restrict 
ourselves to one family of fermions 


u V 
qı = i li = (") (13.1) 


The Lagrangian, for indices « = 1,..., 8; i = 1, 2, 3, is then 
T= —4G""G?, — AnA — 4B" B, 
+ i(qy,D"q + ly D”). (13.2) 


Since there is no Higgs VEV to break the SU(2) x U(1) gauge symmetry, it 
would seem that all fermions and all gauge bosons, including W and Z, will 
remain massless. As we shall presently see, this is actually not the case. 

Let us for the moment turn off the electroweak interaction and remember 
from Chapter 10 some of the basic features of the strong interaction as 
described by QCD. The fact that the u and d quarks are massless implies that 
we have the flavour symmetry SU(2), x SU(2)g. From the discussions in 
Chapter 5 on chiral symmetry and its breaking, all evidence is consistent with 
the picture that this symmetry is realized in the Goldstone mode. The 
symmetry is spontaneously broken with the vacuum being invariant only 
under the diagonal subgroup SU(2), .p 


(iu) = <dd> # 0 (13.3) 


and there are three (exactly) massless Goldstone bosons n*°. Although this 
result has not been proven rigorously in QCD, all indications are compatible 
with this expectation. 

To express the above in a more suggestive notation we define the effective 
scalar and pseudoscalar fields (c, n) having the quantum numbers of quark 
bilinears 


o~qq and r ~ 1qty,q (13.4) 


where q = (4). Thus we have just the SU(2) x SU(2) o-model considered in 
§5.3. The SSB condition (eqn (5.169)) and a generalization of eqn (5.155) 
immediately imply that the magnitude of SSB in (13.3) is given by the pion 
decay constant f, 


[0o = v X95 MeV (13.5) 


namely 
1 
xp {dQ>o = fa = 95 MeV. (13.6) 


fain turn must be related to the QCD scale parameter A, ~ 200 MeV as it is 
the only scale in our theory. 
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Now we turn on the electroweak interaction. From (13.1) the quark 


bilinears transform as 
T, + int, 
£ 13.7 
Pet (" + in; eo) 


The VEV <o> =f, then breaks the SU(2) x U(1) symmetry down to 
the electromagnetic U(1) with n being eaten by the three gauge bosons to 
become W+ and Z. From eqns (11.77) and (13.5) we have 


My = 4gf. ~ 30 MeV (13.8) 


which is about three orders of magnitude smaller than the value (80 GeV) 
required in the standard model. But this simple mechanism of dynamical 
symmetry breaking does obtain the correct relation between My and Mz. 
This is because we have an SU(2), +r symmetry remaining (i.e. the isospin of 
the strong interaction) which will give Mw = Mz cos Oy (i.e. p = 1) as 
explained in §11.2 (see especially eqn (11.83)). 

To see the above results more explicitly consider the vacuum polarization 
diagrams of Fig. 13.1; they are derivable from the Lagrangian in (13.2). 
From the AA diagram in Fig. 13.1 we have 


ni(k) = 3! (2) Ous — kk, k? )kPr(k?). (13.9) 


FIG. 13.1. Weak gauge boson vacuum polarization loops from the quark-gluon states. 


Summing up the bubbles, the propagator of the gauge boson A, is modified 
from 


Sï Guy ~~ kk, /k*) 
k? 


to 
ij (Juv ~ k,k,/k*) 


Se __ 13.10 
EL grk Ce 


Because chiral symmetry must be realized in the Goldstone mode QCD 
interaction implies a massless pole in the vacuum polarization bubble (see Fig. 
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13.2). This means that the vacuum polarization function in (13.9) is given by 
n(k*) = f2/k?. (13.11) 


Namely the contribution of the AA diagram in Fig. 13.1 to eqn (13.10) has a 
pole at 
k? =Min = g°f 3/4 


q 
A A A T A 
WO o Sr => MDL D9 YN 
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Similarly the AB diagram and the BB diagram in Fig. 13.1 have poles at 
Mis = 9917/4 
Min = 9'°f;/4. 
Comparing this to eqn (11.78) we immediately obtain eqn (13.8) with p = 1 
in (11.82). We should comment that if we had taken a three-family, six-quark 
flavour theory, the chiral symmetry would become SU(6) x SU(6) and there 
would be 35 Goldstone bosons. Three of them would become the longitu- 
dinal modes of W+ and Z; the remaining 32 would acquire very small masses 
in higher orders of electroweak interactions 1.e. they are pseudo-Goldstone 
bosons. 
In any case we see that, in the right circumstances, QCD itself breaks down 


the electroweak gauge group in just the right pattern. However it falls short 
of being a realistic possibility because 


(1) the scale is all wrong, we get My ~ 30 MeV instead of 80 GeV as 
required; 

(2) fermions remain massless. 

It is relatively straightforward to overcome problem (1): we postulate the 
existence of another QCD-like interaction, called the technicolour interaction 


(TC), which has a scale parameter Ay: such that it produces the phenomeno- 
logically correct mass for W 


Thus F, ~ 250 GeV and Ajc is of order 1 TeV. In other words the techni- 
colour interaction, with a gauge group SU(3) for example, is in every way 
similar to QCD except that it produces condensate (or VEV) at energy three 
orders of magnitude larger than QCD. Thus there are fermions that carry 


technicolours (the techniquarks Q) with SU(2), x U(1) x SU(3), x SU(3)tc 
transformation properties 


UL 
a ~ (2, 1/3, 1, 3) 


Up ~ (1, 4/3, 1, 3) 
Dp ~ (1, —2/3, 1, 3). 
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The familiar quarks and leptons are TC singlets. These technicolour quarks 
form bound states just like ordinary quarks under ordinary colour inter- 
actions. The TC chiral symmetry is also spontaneously broken with a 
magnitude given by the technipion decay constant 


i 
5772 <QQo =F, = 250 GeV. (13.13) 


Thus we expect a rich spectrum of new particles in the TeV range. 


Extended technicolour 


The picture as developed so far still does not solve problem (2)—massless 
fermions. Quarks and leptons have separate chiral symmetries which remain 
unbroken. This situation is to be compared with that in the standard model 
where Yukawa couplings of elementary Higgs scalar to fermions produces 
fermion ‘current masses’ (see eqn (11.71)) as soon as <o # 0. Thus we must 
find ways to produce effective Yukawa couplings between ordinary fermions 
and technimesons. One possible way to do this is to enlarge the technicolour 
group Gc to an extended technicolour gauge group Gegrc by placing 
technifermions F (having, say, three technicolours) and ordinary fermions f 
(technicolour singlet) in a single irreducible representation of Ggyc. 


a l 
Doga 


Extended technicolour (ETC) breaks down to technicolour at some energy 
scale u. The € vector gauge boson being in ETC but not in TC acquires mass 
Mg ~ gerchH, and couples to currents of the form Fy, f The effective four- 
fermion interactions mediated by € have the form 


l (Getc soe so 
(e) (FY a SL fr? FR) (13.14) 


By a Fierz transformation we obtain 


| Cesena = m 
~ 32 (FF)(ff) — (FysF)dysf) + ...] 


where we have used the relation M, = ggrcu. The condensation of techni- 
fermions <FF), # 0 then produces a mass for the ordinary fermions, 


a 
m= 5 (FF),. (13.15) 


Since (FF) ~ (1 TeV)°, one needs u ~ 30 TeV to produce m; ~ 1 GeV. 
As & vector bosons are SU(3), x SU(2), x U(1) singlets, we need a set of 
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technifermions for each ordinary fermion in order to give latter a mass. For 
one family of fermions, we need eight sets of technifermions, 


U,, U,, U3, D,, D}, D}, N, and E, 


where the subscripts are the QCD colour labels. The flavour symmetry of the 
TC interaction is then at least as large as SU(8), x SU(8)gk. When this chiral 
symmetry is spontaneously broken, three of the Goldstone bosons combine 
with Wt, W_, and Z and we are left with a large number of relatively 
light (on the TeV mass scale) pseudo-Goldstone bosons. This will allow for a 
possible early test of the TC approach to dynamical symmetry breaking. 


Tumbling 


To give masses to several families of ordinary fermions using one family of 
technifermions, we need to break the ETC gauge group down to TC in 
several successive stages. For example, a three-family model could be 
constructed by having the sequential breakdown 


SU(6)F™ > SU(5)FT > SUMET = SU™G) 
H 


with fermions 
F F F f” f° f) 
(= 
ETC 


TC 


transforming as a sextet of ETC. The first family f = {e, v, u, d} would have 
mass m; ~ (1/2u*)<FF>, the second family f’ = {p,v’,c,s} mass mp 
~ (1/2p'*)< FF), etc. 

Can we have such a sequential SSB without an elementary Higgs scalar? 
Does one need to introduce a new TC interaction to perform each successive 
ETC symmetry breakdown? One possible way to avoid this proliferation of 
TC gauge groups is the idea of ‘tumbling’. This is a hypothesis as to the 
behaviour of unbroken asymptotically-free gauge groups with non-real 
fermion representation content. We adopt the convention under which all 
fermions are described by left-handed fields. For example QCD with n 
flavours has fermions qı; and (q°),; = iy2gr, (i= 1,2,...,) and we say its 
fermion content is n(3 + 3*) of SU(3)°, which is real. It will be argued 
below that an asymptotically free gauge group with non-real fermion 
representation content breaks itself when the gauge coupling constants 
become large in the infrared region. Several successive breakings may 
occur before the fermion representation content become real under the 
unbroken subgroup, at which point the ‘tumbling’ stops. This allows us to 
establish a hierarchy of mass scales in an economical and natural way. 

We illustrate the tumbling scheme by comparing QCD to an example of 
the SU(5) gauge group with fermions in the 5* and 10 representations. In 
the tensor notation a 5* is denoted by w' and a 10 by the antisymmetric 
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Wi; = —W, with i,j = 1, 2,..., 5 (for more details, see §§4.3, 14.1, and 14.5). 
Possible scalar bound states composed of such fermion bilinears are 


5* x 5* = 10% + 15, 
10 x 10 = 5* + 45% + 50; 


5* x 10=5+ 45. (13.16) 
This is to be compared to the QCD case 
3x3=3* +6 
3* x 3* =3 + 6* 
3x 3*=1+8. (13.17) 


To have a Lorentz scalar the fermion bilinear must be a symmetric product. 
The bound state ¥,y, potential in the one-gauge-boson-exchange approxi- 
mation is given by 


V(r) = Gs dC eC) (13.18) 


where C, and C, are the Casimir operators of the constituents and C is that 
of the bound state. This corresponds to the familiar relation —2T, -T, = T? 
+ T$ — T? for T = T, + T, in isospin symmetry. We then identify the most 
attractive scalar channel (MASC) for which C, + C, — C is maximum. The 
basic proposition of the tumbling scheme states that, when we move from 
high energies to the low energy region where the running coupling becomes 
large, the MASC will condense first when 


a(uy(C, + C> — C) = 0(1). (13.19) 


This is clearly consistent with QCD where the MASC is the singlet 1 e 3 x 3*. 
For our SU(5) example, the MASC is the 5* contained in 10 x 10. We 
have 


COW We" |0> = WOms (13.20) 


i.e. the difference is that here the condensate is not a singlet and it must single 
out a direction in the SU(5) space and hence breaks the SU(5) gauge 
symmetry: SU(5) > SU(4) at scale u. The SU(5) gauge bosons that are not in 
SU(4) will pick up masses of order gu. The fermion multiplets split up 
according to 


5* = 4* +1 

10=6+ 4. 
Just as in QCD, the fermions that participate in the condensate acquire a 
dynamical mass, closely related to ‘constituent mass’ of §5.5; <ĝq)o = u’? 
implies a dynamical mass term ~ qq for quarks. For the SU(5) case it is the 


sextet in 10 (Yas = —¥,,, a=1,...,4) that picks up masses. These 
dynamical masses are sharply energy-dependent and disappear at an energy 
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scale above u. Below u the heavy particles decouple (Appelquist and 
Carazzone 1975); we have a theory based on the gauge group SU(4) with 
fermion representation content: 4+ 4* + 1. The gauge coupling constant 
now runs according to the SU(4) B-function. The new MASC is 4 and 4* 
combining into a singlet. That channel condenses when 


alu’ (C4 + Ca — Cy) = O(1). 


The SU(4) gauge group remains unbroken and the tumbling stops because 
the condensate is an SU(4) singlet. The 4 and 4* fermions acquire 
dynamical mass of order p’ which is the mass scale of the condensate of 
the SU(4) singlet. Only two mass scales are produced in this SU(5) example, but 
it is easy to construct examples that yield several more. 

What is envisaged is a theory of low-energy (<300 TeV) particle physics 
based on the gauge group 


SU(2), x U(1) x SU(3), x Gere. 


The ETC fermion representation content is complex. From u ~ 300 TeV, ETC 
tumbles down to TC at p’ which is a mass scale of a few TeV. Under TC the 
fermion representation content is real. At u” ~ 250 GeV, the technifermion 
condensates break-down—SU(2), x U(1) > U(1),,,. The quarks and leptons 
are TC singlets but are ETC multiplets in common with the techniquarks and 
technileptons. They acquire masses through an effective Yukawa coupling as in 
eqn (13.15). In principle this scheme allows one to determine the quark and 
lepton masses, the Cabibbo angles, the W* and Z° masses, all in terms ofa single 
parameter, the ETC gauge coupling constant at some given mass scale. In 
practice itis not easy tocarry out this programme. Particularly oneis concerned 
that some of the light pseudo-Goldstone bosons will induce quark (or lepton) 
flavour-changing neutral-current processes at much too largea rate. From our 
discussion it is also clear that in order to achieve the original goal of having a 
theory with very few adjustable parameters we must introducea large number of 
particles. Thus an economical and elegant theory still eludes us. 


13.2 Neutrino masses, mixings, and oscillations 


We have already mentioned in §11.3 that the reason why there are no 
Cabibbo-like mixing angles in the lepton sector of the standard electroweak 
theory is neutrino mass degeneracy (i.e. all vs have the same mass—zero). 
This degeneracy means that there is no need to diagonalize the neutrino mass 
matrix (in fact no mass matrix to begin with). The absence of physically 
significant mixing angles brings about a set of conservation laws for the 
lepton flavours: the electron number, muon number, and the t-lepton 
number. Processes such as u — ey are forbidden. If neutrinos are not strictly 
massless, what are the phenomenological implications of small neutrino 
masses? Besides a nonzero rate for u > ey (see calculation in the next 
section), we have the novel feature of neutrino oscillations. 
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Neutrino oscillations 


This means that a beam of neutrinos (produced through weak interaction 
decays, corresponding to some definite flavour) can spontaneously change, or 
oscillate, into neutrinos of different flavour, e.g. ve v,, while travelling in 
vacuum. This property may explain the ‘solar neutrino puzzle’ (Davis, 
Harmer, and Hoffman 1968; Bahcall et al. 1980). While neutrino masses and 
mixings were discussed earlier by Sakata and his collaborators (see, for 
example, Maki et al. 1962; Nakagawa et al. 1963), these possibilities, 
particularly in connection with neutrino oscillation, have been studied most 
persistently by Pontecorvo (1958; 1968; Gribov and Pontecorvo 1969; 
Bilenky and Pontecorvo 1978). 

If neutrinos are not massless, their mass matrix, just as in the case for 
quarks, will be nondiagonal and complex. One needs to transform it into a 
diagonal form by unitary rotations. Thus the mass eigenstates are different 
from gauge eigenstates 


Vi.) Oy; (13.21) 
a >; aivi 


where v, = Ve, v,, Vv, are weak eigenstates and v; = v;i, V2, v3 are mass 
eigenstates with mass eigenvalues m,, m,, and m,. U is a unitary matrix 
which can be parametrized like the KM matrix for quark mixing angles (see 
eqn (12.40)) 


Ve Ci Sit: $183 vi: 
va, P= | S12 C1C2C3 — S283 €? C4C383 + S2C3 e° H va | (13.22) 
v, — $452 C1S283 + C283 € C1S283 —c,c,e° J \ v, 


where c; = cos 0; and s; = sin 0;. Of course there is no reason whatsoever to 
expect that these angles are in anyway similar to the Cabibbo—Kobayashi-— 
Maskawa angles. 

If at time t = 0, a beam of pure v, states is produced, say by mn” > e*v, 
decays-in-flight, it is initially a superposition of mass eigenstates as 


lv.(0)> = cilv1> + $1C3|¥2> + 8483|V3>. (13.23) 
The time evolution of a state is controlled by its energy eigenvalues. We 


assume that all neutrinos in the beam have a common fixed momentum p; 
then the mass eigenstates have energy eigenvalue 


E? = p* +m? (13.24) 
and | | | 
lv.(t)> = c1 e75 lv Y + sics e7" iva) + 8483e°3|v3>. (13.25) 
The probability of finding a v, at time t is given by |<v,|v(t)>|?. So for example 
the probability of finding a v, is 
P, = (b = 1 — 2cjsje3[1 — cos(E, — Ez )A] 
— 2c?s?s3[1 — cos(E, — E,)t] — 2s{s3c3[1 — cos(E, — E3)t]. 
(13.26) 
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For p > m;, we have 


E, = (p° + m7)? ~ p + (m?/2p) 


and 
E; — E; = (m? — m3)/2p. (13.27) 
It is convenient to define the oscillation lengths 
2n 4np p (MeV) 
l; = ~ —— = 2.5 m| — 13.28 
Y“ E— E, |m} — m| Ea (eV)? l ) 


so that (13.26) may be stated as the probability of observing a v, at distance x 
from the source, 


2 2 
P, (x) = 1 — 2cîsîc} | 1 — cos Tx- 2c2s2s2 | 1 — cos sides 
i lio lis 
42.3 21x 
— 2s3s3Cc3| 1 — cos LIE (13.29) 
23 


For x > /;; the harmonics are smoothed off and only the average intensity 
will be observable 


<P, =v = 1 — 2cts? — 2sts3c3. (13.30) 


The smallest average value possible is ¢P,,.,.> = 1/3 corresponding to the 
special case s? = 2/3, s3 = 1/2. Similarly, 


<Prev = 2W{sje3 + 2s783c3(s3 — cjcz) + 2s{s.83c,C2C3 cos 6(s3 — c?) 


(Pav = 2cjs?s} + 2s{s3c3(c3 — cis?) + 2s¢s,s3c,C C3 cos ô(s3 — c2). 
(13.31) 


Thus, one can in principle measure the leptonic CP angle 6 through the 
transition rates of one neutrino to different neutrino flavours. 
For the more general case 


w=LUy, t21,2,.4,N (13.32) 
VD) = È} Uv, e, 


we can easily obtain the probability of finding flavour v, in a v, beam 


2 
CP ov) = È Ual?lUpl? + E UUU Up; cos (7). (13.33) 


ixj lij 


One can also show that the smallest possible value for the average probability 
(Py vy? = 1/N. 

The magic of this oscillation phenomenon is of course intimately related to 
the quantum mechanical measurement theory. This possibility of one 
neutrino flavour v, spontaneously changing into another flavour reflects the 
uncertainty of energy-momentum measurement in these processes. For 
example, if we can pin down precisely which mass eigenstate v; is produced at 
the source, the oscillation pattern is destroyed, as the precision of momentum 
measurements required to do this will precisely prevent locating the source to 
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an accuracy better than the oscillation length. Thus a more proper treatment 
should make use of wave packets, etc., but the same results are recovered 
(Kayser 1981). 

The principal point one should keep in mind is that in order to have 
neutrino oscillation we must have nonzero (and non-degenerate) neutrino 
masses and mixing angles. 

The question is then: can we have massive neutrinos in gauge theories? 
Before discussing the question of neutrino masses in SU(2) x U(1) electro- 
weak theory we will first study briefly some of the special properties of 
neutrino mass terms. 


Neutrino mass terms of Dirac and Majorana type 


We adopt the conventions with respect to charge-conjugation (C) and 
helicity-projection operations 


V = Cyt =iyry*, PW =Wic 


Wi = 4(1 — y5), Yr = 71+ 75). (13.34) 
We also use the notation 
WL = (WL) = 21 + 75) = (W)p. (13.35) 


The fermion mass terms connect left- and right-handed fields. A Dirac-type 
mass connects the L and R components of the same field, 


Ly = DW + Wav) = Dy. (13.36) 
Thus, the mass eigenstate is 
w= Wit We. (13.37) 


A Majorana-type mass connects the L and R components of conjugate fields. 
In the notation of (13.35), we can have 


Lun = AVL + PLYT) = Axx (13.38) 
Lup = BWW + Ure) = BO. (13.39) 
The mass eigenstates are then self-conjugate fields 
x=% +y, =X 
w = Yr + YR, w = W. (13.40) 
These can be inverted to yield 


YL=4(1— ys); WL=3(1+ys5)x 


Yr = 2(1 + y5)w; R = 2(1 — ys) (13.41) 
when the y, matrix is applied to the y, y, and œw fields 
Y y — YL + Wr 
yi X FH] x PH] hti] (13.42) 


wW co" +Wr—- Wr 
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Clearly this changes the sign of D, A, and B in eqns (13.36), (13.38), and 
(13.39). The fields Y’, y’, and w’ are interpreted as the correct mass eigenstates 
for the minus values of fermion masses. 

When both Dirac and Majorana mass terms are simultaneously present we 
have 


Lom = Di We + AWW + Bi RWe + hic. 
= įD(%w + ©%) + 4%% + Bow 


_ (A 3D\/x 
z 13.43 
(7, ô) ( ee \2) (13.43) 
which can be diagonalized to yield two mass eigenvalues 
M,,. =3{(A + B) + [(4 — BP + D*)""} (13.44) 


corresponding to the Majorana mass eigenstates 
nı = (cos 0)y — (sin 0)w 
n = (sin 0)y + (cos 0)w (13.45) 
with 
tan 20 = D/(A — B). (13.46) 
We can easily invert (13.44) and (13.46) and obtain 
A = M, cos? 0 + M, sin’ 0 
B = M, sin? 0 + M, cos? 0. (13.47) 
Thus the most general mass term (13.43) for a four-component fermion field 
actually describes two Majorana particles with distinctive masses. 
It is interesting to see that the usual four-component Dirac field formalism 
can be recovered in the limit of A = B = 0. When 0 = 2/4, we have mass 
eigenstates (y + w)/./2 corresponding to eigenvalues + D/2. To flip the sign 


of the negative mass we need to apply a chiral transformation as in (13.42). 
Thus the fields 


l 
Wo 


l 


a te +e) oi 


(x + œ) = 
and 
l 
Re 


have the same mass eigenvalue 4D. Because of this degeneracy we are free to 


use any new combinations of fields so long they represent a rotation in the 
(€,, č2) plane. Thus, 


Low(A = B = 0) = 3D(E,6, + €2€2) 
= D(E1E, + F564). (13.49) 


, a E C c) = 
&-0)=-zl WL + YL-— We + WR) =é (13.48) 
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For the particular linear combination (i.e. another 45° rotation) 


l 

o1 = 75 (G1 — 82) = We + Ue 
l 
J2 


it is obvious that (13.49) reduces to Dw with y = Y + Wr. 

Thus a Dirac fermion really corresponds to the degenerate limit of A = B 
= 0 in the more general case of two Majorana particles. Since the Majorana 
mass terms A (13.38) and B (13.39) violate the conservation of whatever 
additive quantum number that y carries, e.g. electric charge, all elementary 
fermions, except neutrinos, being charged must have A = B=0. For 
neutrinos, Majorana mass terms violate lepton number by two units. The 
presence of such Majorana neutrino masses leads, for instance, to neutrino- 
less double B-decays: (Z — 1) —> (Z + 1)+ e +e, or kaon decays such as 
K~ > nee. The quark diagram corresponding to these lepton-number 
nonconserving processes is shown in Fig. 13.3. (We do not consider theories 
with V + A charged currents coupled to heavy IVBs, Wg, where such an 
amplitude does not have to be proportional to a Majorana v mass term.) 


oo = a (61 + On) = YL + YR, (13.50) 


d 


Fic. 13.3. Quark diagram for neutrinoless double B-decays. The symbol @ corresponds to a 
Majorana-mass insertion v,v, + h.c. 


Possible neutrino mass sizes 


At present the principal evidence for a possible non-zero neutrino mass 
comes from a single (not yet corroborated) experiment in tritium B-decay, 
3H > He + e7 + ¥,. It is found that the shape of the electron spectrum 
near the end-point can be interpreted as giving an m,, ranging from 15 to 
45 eV (Lyubimov et al. 1980). All other laboratory results only yield upper 
bounds on the m,s 


m,, < 60 eV, 
m,, < 510 KeV, 
m,, < 250 MeV. (13.51) 


One can however obtain much more stringent bounds if one invokes 
cosmological theories. We shall give a brief account of this constraint. 
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Cosmological bound on neutrino masses. The standard model of big-bang 
cosmology is now generally accepted as being the correct theory of the 
universe. The principal ingredients are the cosmological principle and 
Einstein’s field equations. It describes a universe which is spatially homo- 
geneous and isotropic, and is expanding according to Hubble’s law 


V= HR (13.52) 


where V = velocity, R = distance, and H is the Hubble’s constant (~ 15 km/s/ 
million light years). Actually the basic features can be readily understood 
from Newtonian mechanics. Much like the notion of ‘escape velocity’, there 
is a critical mass density p, corresponding to the precise cancellation of 
kinetic and (gravitational) potential energies 


p. = 3H2/8nG ~ 5 x 1073° g/cm? (13.53) 


where G is Newton’s constant. If the mass density of the universe p < p, then 
it will continue to expand forever (the ‘open universe’); if p > p, then the 
expansion will slow down, eventually stop, and start contracting (the ‘closed 
universe’). The present bound on the total density of the universe, estimated 
from its age and deceleration, is 


p° S 4p, (~2 x 1077? g/cm?) (13.54) 


where the superscript zero denotes present time. On the other hand the 
observed galaxies and clusters can only account for a density p° that is at 
most a tenth of p.. One can then speculate about the nonluminous masses in 
the universe. 

The standard model of cosmology succeeds in providing a common 
meeting ground for a large variety of observational data. The matter of the 
universe is observed to reside primarily in the form of hydrogen atoms and a 
small portion in helium, and other light elements. If the galaxies are rushing 
apart from each other according to Hubble’s law, they should have been 
closer in the past, making up a universe that was smaller and hotter. It is 
argued that such a hot universe would have ‘cooked’ all the hydrogen into 
heavier elements. However the small amounts of heavy elements observed in 
the universe is consistent with our picture of their being produced later in the 
galaxies. Namely they are not primordial in origin and the cosmological 
evolution should be such that no heavy elements are synthesized in the early 
universe. This can be the case if there was an intense field of radiation which 
would blast apart the heavy elements as soon as they were formed. Such an 
electromagnetic radiation, which was once in thermal equilibrium with 
matter, should still be present today, red-shifted by Hubble expansion to 
become a low-temperature black-body spectrum of background photons. 
This background radiation was indeed discovered and was measured to have 
an equivalent temperature of about 2.7 K. (This value is just compatible with 
a nucleosynthesis calculation of the above-mentioned cosmological helium 
abundance.) Using the standard black-body radiation formula relating 
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number density n to temperature 


3 $0) 


n=T 
a 


(ge + 3gr) (13.55) 
with 


1 f x? dx l ] 
¢(3) = | ne ga. a 
0 
where gg, gp are the number of boson and fermion degrees of freedom, 
respectively (e.g. gg = 2 for a photon gas (2 helicities), ge = 2 for massless 
neutrinos, g-=4 for electrons, etc.), we have the present photon 
number density for T = 2.7°K 


n? ~ 400 cm~?. (13.56) 


Similarly, the standard big-bang model suggests that the universe is 
immersed in a sea of primordial neutrinos. Because the neutrinos went out of 
thermal equilibrium before ee~ annihilation heated up the background 
radiation, the present cosmic black-body neutrinos should have a lower 


temperature (Peebles 1966) 
Po A 
NN. ey cee 13.57 

(Fs) =i monn 


or T? ~ 1.9K. Combining eqns (13.55)-(13.57), we have neutrino number 
density 


i ek = 
n? ~ (3) n? ~ 110 cm~? (13.58) 
corresponding to a neutrino mass density, 
3 
poy i n?m; = 9. 2m;(eV) x 107°! g/cm? (13.59) 


where the sum is over neutrino mass eigenstates. The bound (13.54) can then 
be converted into a bound on the sum of neutrino masses (Gershtein and 
Zeldovich 1966; Cowsik and McClelland 1972; Szalay and Marx 1976) 


ym; < 100 eV. (13.60) 


It is amusing to speculate that the universe is ‘flat’ with a mass density of 
precisely p, and that the nonluminous masses all reside in the form of neutrino 
masses (i.e. ignoring all other possibilities such as magnetic monopoles, etc.); 
we then have 


ym, = 25eV. (13.61) 


l 


How can we have small neutrino masses? Fermion masses, whether coming 
from bare mass terms or Yukawa couplings (through Higgs mechanism), are 
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arbitrary parameters in gauge theories due to infinite mass and coupling 
renormalization. Hence they are generally not calculable and have to be 
determined experimentally. 

The neutrino masses, if nonzero, must be small when compared to all other 
mass scales. Theoretically one would like to have some way to understand 
their smallness. There are a few special cases where neutrino masses are not 
arbitrary and can be small. 

(1) The most obvious possibility is that the neutrinos’ mass terms are 
absent to the zeroth order and that higher-order radiative corrections give 
rise to masses (Georgi and Glashow 1973; Cheng and Li 1978). Consider the 
Dirac mass terms. If the particle content of the theory is such that there is 
neither the bare mass term Wr (i.e. Y, and Wp transform differently) nor the 
Higgs-generated mass term W,I*Wp<d*>_ (because o, is absent or <¢,)o 
= 0), then the diagram in Fig. 13.4 is finite as there are no possible 
counterterms to absorb the infinity. Such an m, in a theory where vp is 
assumed to transform nontrivially under the gauge group (and thus also to 
couple to the W-boson) should be calculable and is of the order g?m,. A 
similar situation is also possible for Majorana-type mass terms. 


W 
Vp f VR 
Fic. 13.4. 


(2) As we shall discuss in the next chapter, grand unified theories (GUT) 
of the strong and electroweak interactions demand the existence in one 
theory of two vastly different mass—energy scales 1, , corresponding to two 
stages of symmetry breaking 

Gour > SUG) x SU(2) x UT) 2 SUQG)cotour X UC )em. 
In certain situations this extremely small ratio u,/u; «< 1 (of order 107 '?) 
can be reflected in the fermion spectrum. The neutrino mass matrix of (13.43) 
may naturally have the form (Gell-Mann, Ramond, and Slansky 1979). 


E mi 13.62 
Mi= in M (13.62) 


with m ~ u>, M ~ u. The eigenvalues are m, ~m’/M and m, ~ M; 
(m,/m,) > (u/u,) « 1. Thus one ends up with one ‘superheavy neutrino’ 
and one extremely light particle which can be identified with the ordinary 
neutrino. 


Neutrino masses in SU(2) x U(1) models 


In the standard SU(2) x U(1) theory of electroweak interactions, neut- 
rinos are massless because the simple Higgs structure of the theory leads to a 
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global symmetry corresponding to lepton-number conservation which 
forbids the Majorana mass term Vi vı and there are no vz that could combine 
with vı to form a Dirac mass term. In other words the masslessness of the 
neutrinos is related to the restricted particle content being considered in the 
standard model. When we consider a more complete unification it is 
inevitable that the number of fields will increase, with the consequential 
appearance of neutrino mass terms. (We shall see in §14.5 that the simplest 
grand unification scheme, the minimal SU(5) model, still has a global 
symmetry corresponding to the conservation of baryon number minus the 
lepton number (B — L) which also forbids the Majorana mass terms.) 
However in GUT such as SO(10) B — L is broken and neutrinos naturally 
acquire masses. Although recent interest in massive neutrinos is tied to the 
exploration of grand unification we shall study the m, # 0 extensions of the 
minimal SU(2) x U(1) model directly (Cheng and Li 1980qa) as all grand 
unified models must contain SU(2) x U(1) as a subtheory anyway and it is 
much simpler to work with the electroweak theory. 

In the standard electroweak model, the lepton fields and Higgs scalar have 
the following SU(2) x U(1) transformation properties 


TY sc ka 
©, = WE (2, +1). (13.63) 


The first entries in parentheses on the right-hand sides of eqns (13.63) are the 
dimensions of SU(2) representations and the second entries are the U(1) 
hypercharge Y = 2(Q — T). Lepton flavour indices are suppressed. Terms 
bilinear in lepton fields are 


llr i (2, 1) x (1, =2) a (2, =I) 
kh, ~ (2, —1) x (2, — 1) = (1, —2) + (3, —2) 
Tig ~ (1, —2) x (1, —2) = (1, =4). (13.64) 


With ® ~ (2, 1) only the Yukawa couplings /p/,® + h.c. are present in the 
standard model and we have a global symmetry corresponding to lepton- 
number conservation. 


There are many possible extensions of the standard model to give m, 4 0; 
they can be broadly categorized as 


(1) Extension in the Higgs sector only; 
(2) Extension in the lepton sector only; 
(3) Extension in both Higgs and lepton sectors. 


In case (1), other scalars, besides the doublet ®, that can join the lepton 
bilinear in (13.64) to form SU(2) x U(1) gauge-invariant Yukawa couplings 
are triplet: H ~ (3,2), singly charged singlet: h* ~ (1,2), and doubly 
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charged singlet: R*+ ~ (1, 4). For example the triplet H with Y = 2, 


( Ht Y2 H+ ) 
cH = (13.65) 
J2H° -H* 


gives rise to the additional Yukawa coupling and trilinear scalar coupling 
fT lH (et); + uO; H* (et); + h.c. (13.66) 


where e= it}, with c; and (et); being, respectively, antisymmetric and 
symmetric. When H develops a vacuum expectation value 


0 0 

<t Hg = ( (13.67) 
Vy 0 

a Majorana mass term for the neutrino, (vyf) vivu, results. It should be noted 

that vy also contributes to the W and Z masses with 


v + 2vå 


= (M,,/M, cos 6y,)? = 
p = (My /M, w) v2 + 403 


(13.68) 
where v, is the vacuum expectation value of the doublet. The phenomeno- 
logical result (12.120), p = 0.998 + 0.050, restricts (vy/v,) < 0.17 if one 
standard deviation is allowed. 

In case (2) the simplest scheme is obviously the addition of a neutral 
singlet, the right-handed neutrino vg. We then have the additional 
Lagrangian terms (for simplicity we first consider a one-flavour theory) 


L' = flav? + Bigva + hic. > Dive + BYSvp the. (13.69) 


where D = (1/,/2)ug J. A Majorana bare-mass term B is present because vp is 
totally neutral with respect to the SU(2) x U(1) group and we do not impose 
lepton-number conservation on the theory. Thus in this extension we are 
naturally led to consider neutrino mass terms of the Dirac and Majorana 
types (eqn (13.43)). Since the Aviv, term is absent (i.e. we do not introduce 
the triplet Higgs particles) the mass matrix 1s in fact of the form (13.62). If we 
make the plausible assumption that the Dirac mass term D is of the order of 
charge-2/3 quark masses and the Majorana mass term B is of the order of 
the energy scale when the GUT is broken into SU(3) x SU(2) x U(1), hence 
has a very large value. Then the weak eigenstates vı and vp are super- 
positions of two Majorana mass eigenstates with vı being predominantly a 
neutrino with a light mass ~4D?/B and a tiny admixture (D/B) of a super- 
heavy neutrino with a mass œ B; the converse holds for vp. 

One may also entertain the possibility that D and B are of comparable 
magnitude and, for whatever reason, are both small. In this case the mixing 
angle will not be small and the small mass eigenvalues allow for neutrino 
oscillations. But this is a type of oscillation different from those considered 
earlier (13.21) as the diagonalization of (13.43) (eqn (13.45)) means 


K\ _ Coot nm’ Ni (13.70) 
w sinô cos@/\n, 
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and the chiral projections of y and œ (13.41) are just the usual weak 


eigenstate neutrino 
l X V VL 
- (1 — = 13.71 
2 ra (2) | ia be a 


Thus we have neutrino—antineutrino oscillations vı > (v°), i.e. vi > (Vp). 
Since the vps interact only through the superweak Yukawa couplings, such 
an oscillation with neutrinos turning into ‘sterile particles’ would be very 
different from the flavour oscillation discussed before (Barger et al. 1980; 
Cheng and Li 1980a). When the lepton-flavour family structure is taken into 
account, D and B become 3 x 3 matrices (for three lepton flavours). Each 
weak eigenstate neutrino will be a superposition of six Majorana mass 
eigenstates. One will then have both flavour-changing oscillations and 
particle—antiparticle oscillations: v,, <> vp, and v, e (vb). 

With the guiding principle that global symmetries should be determined by 
gauge invariance and renormalizability once the particle content is fixed, we 
find in this study of simple m, # 0 extension of the minimal SU(2) x U(1) 
model such that, if m, 4 0, the physical neutrinos invariably turn out to be 
Majorana particles. This reflects mainly the point that if both Dirac and 
Majorana mass terms are present the mass eigenstates are still the self- 
conjugate Majorana fields. One can conclude that Majorana fields are 
natural representations of neutrino particles. 


13.3 p — ey, an example of A,-gauge loop calculations 


Renormalizability is the principal feature of gauge theories of weak 
interaction. It will be instructive to go through the details of one non-trivial 
example of a higher-order weak radiative correction calculation. Here we 
study the neutrino-oscillation induced u — ey as an illustration of R.-gauge 
loop calculations. Thus we are working in the framework of the standard 
model (with one doublet of elementary Higgs scalars) modified by the 
presence of neutrino mass terms. We choose this example because, unlike the 
sd — sd box diagrams of §12.2, the would-be-Goldstone bosons are here not 
negligible in the leading order and must be included to obtain a gauge- 
invariant finite result. Also, the basic mechanism in Fig. 13.6 for u > ey 
involves the distinctive non-Abelian features of the theory: the existence of 
trilinear couplings of gauge bosons. Of course it is a GIM-suppressed 
amplitude (by the neutrino mass difference). However, if observable (this 
unfortunately is not likely), its interpretation will not be complicated by the 


Fic. 13.5. 
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strong interaction effects as in the case of strangeness-changing neutral- 
current processes. 
The u > ey amplitude Fig. 13.5 can be written 


Tu > ey) = e*Xel Ju) (13.72) 
where ¢“(q) is the photon polarization. T has the Lorentz decomposition 


cel JZ" lu) = üp — g)Lig’o,,(A + Bys) + y(C + Dys) + q,(E + Fys)]u,(p) 


(13.73) 
where A, B, ..., F are the invariant amplitudes. From electromagnetic gauge 
invariance 

us = 0, (13.74) 
we have the condition 
—m(C + Dys) +m,(C — Dys) + q°(E + Fys) = 0 
or 
C=D=0 (13.75) 


when the photon is on-shell q? = 0. And, since e*g, = 0, the on-shell u > ey 
amplitude is a magnetic transition 


T(u > ey) = ap — 9)[ig’o,,(A + Bys)Ju,(p). (13.76) 


As (13.76) corresponds to a dimension-five operator, the on-shell u > ey 
amplitude must be represented by a set of loop diagrams. They result in a 
finite amplitude since there can be no counterterm to absorb the infinities— 
the same reason why the (g — 2) anomalous magnetic moment of electron 
must be finite and calculable in QED. 

As it is a lepton flavour-changing process it is strictly forbidden in the 
standard theory with m, = 0 and muon number conservation. In this section 
we shall assume that neutrinos are not massless, and their mixings and 
oscillations mediate u — ey (see Fig. 13.6). Initially we choose to work with, 
neutrinos having the more familiar (pure) Dirac mass terms, 


Y= Uy -@ =e, pst = I, 2,3; (13.77) 


Extensions to the more general cases of Majorana neutrinos will be 
presented at the end of the section. The lowest-order diagrams contributing 
to the p — ey amplitude in the R,-gauge are displayed in Fig. 13.6. Since we 
know that the final amplitude must have the form of the magnetic transition 
(13.76), our strategy is to ignore all terms that cannot be reduced to the 
magnetic moment term. This means that there is no need to calculate the 
diagrams in Fig. 13.6(e) since they are all proportional to i,y*u, and will be 
cancelled by terms of similar form coming from the diagrams in Fig. 13.6(a)— 
(d). As we shall make the approximation m, = 0, the two u — ey invariant 
amplitudes are equal, 


A=B, (13.78) 
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Fic. 13.6. One-loop diagrams for the p > ey amplitude. The dashed lines represent the would- 
be-Goldstone bosons in renormalizable gauges. Thus, only (a) and the two graphs in (e) remain 
in the unitary gauge. 


corresponding to the final-state electron being left-handed (it is e; that 
couples to W and the helicity cannot be flipped in the m, = 0 limit). Also, 
using Gordon decomposition (see Appendix A), we have 


T = Aup — gq) + ys )io,,q°e’U,(p) 
= Au,(p — 4)(1 + ys (2p — myye)u,(p). (13.79) 


Thus in our calculation of the invariant amplitude A we need only to 
concentrate on the p:e term. The momentum assignments of diagrams (a) to 
(d) in Fig. 3.6 are shown in Fig. 13.7. 


q 
k k+q 
p ptk p-q 
Fic. 13.7. 


Diagram (a). 
_{ d*k | ig i i ig 
T;(a) = —i Min Ee = a(5 45) Uci — 1 Ea (555 
x Uy — padu (D) LIA (RK) ITAA + 9) (— ie) page” (13.80) 


where I[,,6’ is the W-boson photon vertex: For T,ag(ki, k2, k3) = 


[(A3 — k1)aGyp + (k2 — k3)yGap + (kı — k2)g9;a] with all the k;s flowing into 
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the vertex, we have 
E T pag = q, k + q, —k)= lg E = [(2k: E)Gug — (k T 24) p&q aa (k T TD a€p | 
(13.81) 
where A’¥(k) and A“*(k + q) are the W-propagators in the R,-gauge: 


Aww(k) = —Lopy — (1 — Ekk, /(k? — €M*)1/(k? — M?) (13.82) 
and M is the mass of W. U. and U, are the mixing angles of (13.77). 


When we sum over all the diagrams corresponding to the three intermediate 
mass eigenstate v,s 


UsU,, l m? 
PERRE OA UžU i + ——— +... 
3 fe +k) — mi z- 3 Í eae +h? [p+ OPP 
U _USU mz 
+... (13.84) 
= 2 [p+ kPy? 
the leading term vanishes, 2; UU; = 0, reflecting the GIM cancellation 
mechanism. We then have 


T(a) = ¥ Ta) = ic di a (13.85) 
OTO JOD Up + kT | 
where 
ge ; 
C= a $ URU, mM; (13.86) 
R = AKA" (k + QN Tag (13.87) 
with 
Nyy = UCP — 4), (P +K — ys)uy(p)- (13.88) 
The W-boson propagator can be split up as 
A®(k) = AP (k) + 4A5°(k) (13.89) 
with APE) = — (9 — k"k'/MP)(K? — M?) 
AL(k) = —(k#k’/M7)/(k*? — EM?). (13.90) 
Substituting (13.89) into (13.87) we note that 
AY (kA (k + QT ,g = 0 (13.91) 
because 
(k + q)*k’T,, = 0. (13.92) 


We have for (13.85) 
T(a) = ic |r 
(27)* [(k +p) ] 
Sı — S2 — S3 S2 
fc — M*)[(k +q}? — M°] . (k? — €M*)[(k + q) — €M?] 
S3 


(KF = MO + q- Tal = 


+ 
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where 
Si = IPN, 
Sa = (KTRK N y )/M? 
S3 = [(k + g)T4[(k + D'N, ]/M°. (13.94) 


Combining the denominators using Feynman parameters and shifting the 
integration variable, we get 


| dtk [S,-8,-S § 
T(a) = 13!c fa da, da, I Ee a T 


$ 
where 
a? = (1 — a,)M? +... 
b? = fd — a —a,)E + a, |M? +... 
d?’ = [(1 —a, — a2) +0,€]M7+.... (13.96) 


Picking out only the p:e terms, 
S, > §, =(p-e)[a.(1 + ys Ju 2m, [2 — «,)* + (2a, — 102] 
S, > S, = —k°(p Dah + ysu, ](mn,/M?) 
x {(3a%, — 1) + [2a} — a, + (2%; — 1/2)]} 
S; > S, = —k°(p ALa] + ysu, (m,/ M’ [20] + a, + (2%, — 1/2)a.). 


(13.97) 
After momentum integrations 
d*k l 4 
(2n)* (k? —a?)*  96r?at 
d*k k? —i 1 
loa (k? — a*)* 48r? a? a 


and the integration over Feynman parameters a, and «,, the contribution 
from diagram (a) to the invariant amplitude 4 is 


c m, 1 Iné l č ln é 
Ala) aa ao (li =e ') | (13.99) 


Diagram (b). 


is ee ee eee ee 
T(b) = —i | Ony? E q) (z 55) Uža — ys) Pare A 


a U. 
x (=) > [m(1 — ys) —m,(1 + js | 


x [-iA*"(k + q)] — (ieMe,). (13.100) 


a 
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Splitting the W-propagator as in (13.89) and summing over all v,-diagrams 
using the approximation (13.84) and also 


Uš Unm? US Um? 
L oan oe? R 
we have 
_ ( d*k N, l 
ae | (ny* [(p + O 2 — EM? 
[e* —e-k(k + q)*/M7]_  e'k(k + g)*/M? 
k+- M? tkt -Mf 12102) 


where c is the coupling constant of (13.86) and 


N, = u(p ~ q)(l T Ys yalp + k)? = m,, (kK i m,) ]u,(p). (13.103) 


Combining denominators using Feynman parameters and shifting the inte- 
gration variable, we have 


T(b) = 6ic fo da, da, lane | ope + wom (13.104) 
where a and b are given in (13.96). Again picking out p:e terms in the 
numerator, 
N, = Nè > Ñ, = -Ap OA + y5)u, Jom, (13.105) 
N, =N,(k + q)*k-¢/M? > Ñ, = k?(p-e)[a.(1 + ys)u,] 
x 9m, [(1 — 4a,)(1 — a, — a2) — (1 — 3a,)]. 


After the momentum integrations of (13.98) and Feynman parametric 
integrations we have the invariant amplitude from diagram (b) 


c m|5 41ngé 7/ 1 Ciné 
A(b) ~ pate ae t3 eS AKG ae ')} (13.106) 


Diagram (c). Following steps which are entirely similar to those followed 
in the calculation of diagram (b) we obtain 


c mf5 inë 1/1 \/éElné 
A(c) al- eiti -S 1) | (13.107) 


Diagram (d). 


dtk i Už 
T(d) = —i |e fa — q) (4) 7 [m(1 + ys) — m1 — ys)] 


x ree ea Gr oF [m1 == Ys) = mi aJ DO 


i i ; 
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Making the aca as m, = 0 and using (13.101), we have 


dék 
T(d) = o j7 [u.(p — q) + ys)ku,(p)] 
2k'e l l 


* Op + ky? K — EM? (k + q7 — EM? (13.109) 


Combining denominators, we perform the momentum integration 


E —i m 
T(d) = —4ic(p-e)(u.1 + ysu] [a da, % (a, + a) san) 


(13.110) 


where a’? = (1 — a, )€M?. After integrating over the parameters we have the 
invariant amplitude 


m, \ 5 
A(d) = _ |. 13.111 
) = S a (2e) 6é l ) 
The contributions of diagrams (a) to (d) are summarized in Fig. 13.8. 
R: gauge *t Hooft gauge (¢ = 1) unitary gauge (č —> oo) 
1 5 
Ry 1 — -f (&) + 29(¢) r l 
3 3 
5 4 7 
„A — + -f -- 98) l 0 
/ 6€ 3 3 
5 1 
So ZSO t-I 0 0 
6č 3 


PB ae ae o 


7 
iN 3¢ 3 


Fic. 13.8. Diagram contributions to the u > ey invariant amplitude Æ = B in units of 


(eg?/8M°)(m,/327°) 6, with f (&) = In E/E — 1), g) = LES (§) — LIE — 1). 


Clearly we have for the total contribution of these diagrams, 


2 
g My 


A=B= 13.112 
© 8M? 32m? cee 

where ô, is the GIM suppression factor 
ô, = > USU,(m?/M7). (13.113) 


We note that in the final result the dependence on the gauge parameter č has 
been cancelled out. A straightforward calculation of the decay rate yields 


3 
m 
I (u > ey) = an (141? + |BI*) (13.114) 


and using I'(u > evv) = m?G;/192n° this can be converted into the branching 
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ratio (Cheng and Li 1977; Petcov 1977; Marciano and Sandra 1977; Shrock 
and Lee 1977) 


Buu > ey) = I (u > ey)/T (u > evy) (13.115) 


where « is the fine structure constant and we have used G,/./2 = g?/8M?. 
Even if one takes a neutrino mass that saturates the cosmological bound 
100 eV (eqn (13.60)), we still have a hopelessly small B(u > ey) < 107*°. 

We now comment very briefly on the situation when neutrinos are 
Majorana particles (Cheng and Li 1980b). Again we can distinguish two 
broad categories. 


(1) The neutrino mass terms are pure Majorana AvVjv,. Each weak 
eigenstate is still a superposition of three (Majorana) mass eigenstates and 
the above ey calculation should be carried through without 
modification; 

(2) The neutrino mass terms have both Dirac and Majorana types. In 
particular when we have the situation of (13.69), DV, vp + BVRVp + h.c. with 
B >> D. (Theoretically this is the most attractive possibility.) Each weak 
eigenstate is a superposition of six Majorana mass eigenstates: three are light 
with m ~ D?/B; three are very heavy with M ~ B(>My). This u > ey 
calculation without the approximation of small intermediate neutrino masses 
has been carried out. As one would expect, the GIM cancellation is no longer 
complete because of the presence of superheavy neutrinos. However B(u 
— ey) is still unobservably tiny as the admixture of heavy neutrinos in the vis 
is extremely small being 0 ~ D/B or (m/M)?. One concludes that only in the 
case with the most general neutrino mass terms eqn (13.43) (i.e. having both 
Higgs triplet and vgs) and with their magnitude coming out as B > D> A is 
there any possibility of getting a neutrino-oscillation induced B(p — ey) 
which is large enough, hopefully, to be detectable. 


14 Grand unification 


We have some confidence that elementary particle interactions down to 
distances as small as 10~'© cm are correctly described by SU(3) x SU(2) x 
U(1) gauge theory. This is the standard model of particle physics: quantum 
chromodynamics is the strong-interaction theory and the Glashow- 
Weinberg—Salam model provides the theory of weak and electromagnetic 
interactions. 

Clearly it is desirable to have a more unified theory which can combine all 
these three interactions as components of a single force; a theory with only 
one gauge coupling. Georgi and Glashow (1974) showed that for the 
standard model, with the presently known quarks and leptons in each family, 
the simplest unification gauge group of colour and flavour is SU(5). How- 
ever, because of the large differences in coupling strengths of strong and 
electroweak interactions, this unification would not become apparent until 
the energy scale of 10'* GeV was reached, corresponding to a distance scale 
of 10778 cm. 

Unification of colour and flavour was discussed earlier by Pati and Salam 
(1973). There are also other attractive models based on the gauge groups 
SO(10) (Georgi 1974; Fritzsch and Minkowski 1975) and E(6) (Gursey, 
Ramond, and Sikivie 1975). In this chapter we shall concentrate on the 
simplest grand unified model based on SU(5). 


14.1 Introduction to the SU(5) model 


We shall first give the basic structure of the SU(5) model. (For a detailed 
discussion see Buras, Ellis, Gaillard, and Nanopoulos 1978.) The original 
motivation of Georgi and Glashow for using the gauge group SU(5) will be 
presented at the end of this section. 

A general representation under an SU(5) transformation may be expressed 
in tensor notation (see §4.3). 


Wi > ULULURU,.. wan (14.1) 
where all indices run from 1 to 5 and 
[UJ], = Lexp(ia*A*/2) J), 


is a 5 by 5 unitary matrix. {4°}, a =0,1,..., 23, is a set of twenty-four 5 x 5 
generalized Gell-Mann matrices, which are hermitian and traceless (so that 
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the Us are unitary and have unit determinants), with normalization tr(A7”) 
= 26”. For example, 


ro 2 
3 = 0 Pon 2 (14.2) 


—1 —3 


To obtain the SU(3) x SU(2) content of a representation we identify the first 
three of the SU(5) indices as the colour indices and the remaining two as 
SU(2), indices, 


i=(o,r) with a=1,2,3 and r=4,5. (14.3) 


Fermion content 


In the standard SU(3) x SU(2) x U(1) model there are 15 left-handed (LH) 
two-component fermion fields in each family (generation). As we shall see, 
grand unification theories in general and the SU(5) model in particular do not 
shed light on the fermion replication problem (see §11.3). Thus for the sake of 
brevity, we shall initially write down the theory for the first (e) fermion family 
only 


(ve € J: (A, 2) 
er: (1, 1) 
(u,, da): (3, 2) 
ur: (3*, 1) 
dî“: (3*, 1) (14.4) 


where the SU(3) x SU(2) transformation properties of the family are 
displayed on the right-hand side of eqns (14.4). The superscript c stands for 
charge conjugate field (see eqns (13.34) and (13.35)) 


Wo = CPY* = iy?y* 
(Ur) = (V°) = Yi. (14.5) 
The SU(3) x SU(2) contents of the simplest SU(5) representations are 
The fundamental rep y; 5 = (3,1) + (1, 2) 
The fundamental conjugate rep yê 5* = (3*, 1) + (1, 2*) 
The antisymmetric 5 x Sy;; = — Y; 10 = (3*, 1) + (3,2) + (1, 1). 
(14.6) 


The less obvious points in the above decomposition are e*”"',, ~ (3*, 1) and 
Es" ~ (1,1), where we have followed the index-labelling convention of 
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(14.3). A comparison of (14.4) and (14.6) shows that one family of fermions 
can be accommodated snugly in an SU(5) reducible representation of 5* + 10 


5*- (YÒL 2 (d‘'d"d° Bo 2 VaL (14.7) 
or 
5: (Wir = (d; d, dz e*t — vi) (14.8) 
and 
0 u? u? u dy 
— uy” 0 uc! u, d, 
1 
10: (ijt = wD u? BRI 0 U3 d, (14.9) 
-u -u -u 0 e 


d, —d, —d, —et 0 L 


It should be noted that in this one-family approximation gauge 
eigenstates are identical to mass eigenstates. We have thus labelled all the 
above gauge eigenstates by fields with definite masses. When the p and t 
fermion families are introduced (§14.5) they will also be assigned to 5* and 
10 representations. All the gauge eigenstates will then be some linear 
superpositions of mass eigenstates (see §11.3). 

We have chosen the phase convention of having the neutrino field appear 
in 5* (and 5) with a minus sign. This conforms to our previous choice of 
I? = (v,e) as a 2 under SU(2) and as being related to its conjugate l, = 
(e, —v), through the antisymmetric tensor l? = e”],. In the above we have 
only considered the SU(3) x SU(2) assignment. The correctness of the 
particle’s U(1) charges will be shown below when we discuss ‘charge 
quantization’. | 


Charge quantization 


One immediate consequence of the SU(5) scheme is a very simple explanation 
for the experimentally observed charge quantization. In fact, whenever the 
unification gauge group is simple, the charge quantization will follow. This is 
because the eigenvalues of the generators of a simple non-Abelian group are 
discrete while those corresponding to the Abelian U(1) group are continuous. 
For example, in the SO(3) group of rotational symmetry the eigenvalues of 
the third component of the angular momentum can take only integer or half- 
integer values, while in the U(1) symmetry of translational invariance in time 
there is no restriction on the (energy) eigenvalues of the corresponding 
generator. Thus in SU(5) theory, since the electric charge Q is one of the 
generators, its eigenvalues are discrete and hence quantized. 

Since the electric charge is an additive quantum number, Q must be some 
linear combination of the diagonal generators in SU(5). There are only four 
of these in SU(5) and, since Q commutes with all the SU(3),.1.4, elements, we 
have 


Y 
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where T; and T, are the diagonal generators belonging to the SU(2) and U(1) 
subgroups, respectively; they are 4 of eqn (14.2) for the fundamental repre- 
sentation. We see from eqn (14.10) that, for the SU(5) group, the 
formula for Q is not enlarged by terms beyond those already present in the 
Glashow—Weinberg-Salam model. The coefficient c which relates the 
operators Y and Tọ can be obtained by comparing in the fundamental 
representation the values of Tọ as given in (14.2) and the hypercharge values 
of the particles in (14.8), ie. Y(5) = (—4, —4, —4, 1, 1), 


c= —(5/3)/, (14.11) 


The presence of the coefficient c signifies that the hypercharge Y is not 
properly normalized to be one of the SU(5) generators which have their scale 
fixed by the non-linear commutation relations 


At AP A 
SRE |e iC: — 
É J 2 


tr(Ata) = 25% a,b,c =0,1,..., 23. 


with 


To check (14.11) we note that for the fundamental representation T, = 44/2, 
eqns (14.10), (14.11), (14.2), and (14.8) yield 


—1/3 
—1/3 
Q(W;) = —1/3 = Q; ð;j. (14.12) 
l 
0 
It follows that the fundamental conjugate representation 5* has 

O(W') = — Q; 6;; (14.13) 
From the transformation property shown in (14.1), a general tensor wi: 

has the same quantum number as y'W/w,.... Thus, 
QM) = Q; + Q; (14.14) 
OW!) = Q; — Q;. (14.15) 


These quantities are, in a self-evident notation, the diagonal elements of Q 
for the 10 and 24 multiplets (also see eqn (4.141)). 

The most interesting aspect of charge quantization as shown in (14.12) is 
the relation between colour SU(3) and charge. The traceless condition for the 
charge operator requires, for three colours, 


30, + QO. =0 (14.16) 


Quarks carry 1/3 of the lepton charge because they have three colours. 
Thus SU(5) theory provides a rational basis for understanding particle 
charges and the weak hypercharge assignment in the standard electroweak 
model is understood. 
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Anomaly cancellation 


We should also check that the LH fermion assignment is free of anomaly. Of 
course, we already know from eqn (11.52) that each family is anomaly-free 
with respect to the SU(3), x SU(2), x U(1) gauge bosons. Now we need to 
make certain that the fermionic couplings to all the remaining SU(5) gauge 
bosons do not introduce anomaly either. 

In general the anomaly of any fermion representation R is proportional to 
the trace (see eqn (6.60); Georgi and Glashow 1972b; Banks and Georgi 1976; 
Okubo 1977) 


tr({T"(R), T*(R)}T*(R)) = $4(R)d™ (14.17) 
where T°(R) is the representation matrix for R and the d®°, are the totally 
symmetric constants appearing in the anticommutators (5.238) 

{A%, APY = 2AM AS. (14.18) 


We note that A(R) in (14.17) which characterizes the anomaly of a given R is 
independent of the generators and is normalized to one for the fundamental 
representation. Thus we can use some simple generator to calculate A(R) and 
to show that the anomalies cancel between the 5* and 10 representations. 
Take, for example, T" = T? = T° = Q; we immediately find 


A(5*) tr Q°(W") 

A(10) tr Q? (Yis) 
E =D + 0° E 
~ 3(—2/3)3 + 3(2/3)3 + 3(-1/3)3 + 13 


and 
A(5*) + A(10) = 0. (14.19) 


Thus the combination 5* and 10 of the fermion representation is anomaly- 
free. 


Gauge bosons 


The SU(5) adjoint representation A‘ has dimension 5? — 1 = 24 and an 
SU(3) x SU(2) decomposition 


24 = (8, 1) + (1, 3) + (1, 1) + (3, 2) + (3*, 2). (14.20) 
Using the index-labelling convention of (14.3), we interpret this as 
gs (8, 1) are the SU(3), gluons G4 of eqn (10.63) 


A’; (1,3) are the three SU(2) vector fields W with W+ = (W! + W?)/ V2 
of eqns (11.38) and (11.76) with the notational change of A to 
W. 


— J4? + VA; (1,1) is the U(1) B-field of eqn (11.39) corres- 
ponding to the diagonal element of Aj which does not belong to 
either SU(3) or SU(2). 
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The remaining 12 gauge fields have both SU(3) and SU(2) indices 
Ar: (3,2) Az: (3*,2). 
They are denoted as X, Y gauge bosons 
EKo A= (12) 
These vector particles, having non-zero triality with respect to the colour 
SU(3) group, carry fractional charges. According to eqn (14.15) we have 


Qx = —4/3 and Qy = —1/3. (14.21) 


If we put all the SU(5) gauge bosons in a 5 x 5 matrix form, A = 
2o A: 


Xy Y; 
[G — 2B//30]j X, Y; 
A= 5 X3 Y3 
X Xox W?/./2 + 3B/ V30 w+ 
ys ye y? W- —W?/ V2 + 3B/./30 


(14.22) 


Spontaneous symmetry breaking is supposed to take place in two stages, 
characterized by two mass scales as given by the vacuum expectation values 
of two multiplets of Higgs fields v, > v, 


SU(5) 5 SU(3) x SU(2) x U(1) 3 SU(3) x U(1). 


This corresponds to the X, Y masses being superheavy Myx, y > My.z. The 
problem of spontaneous symmetry breaking in the SU(5) model will be 
studied in the next section. 


Motivation for the SU(5) group 


In constructing a grand unified theory of strong, weak, and electromagnetic 
interactions with one coupling, we seek a gauge group which is simple, or at 
most a product of identical simple groups (with the same coupling constants 
by some discrete symmetries). It should be large enough to contain the 
SU(3) x SU(2) x U(1) group of the standard model as a subgroup; thus it 
must be at least of rank 4. By this we mean that it must have at least four 
generators that can be simultaneously diagonalized, since it must contain 
the standard model which already has four mutually commuting genera- 
tors: two from the colour SU(3) and two, the weak T, and the hypercharge 
Y, from SU(2) x U(1). 

In fact one can make an exhaustive listing of Lie groups with a given 
rank (l) 


A,=SU( +1), B,=O2/+1, C,=Sp2),  D,= 02) 
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as well as (the exceptional groups) E, 73, F4, and G,, the subscript 
indicating the rank. Thus the candidate / = 4 groups are SU(5), O(9), Sp(8), 
O(8), F4, SU(3) x SU(3), and SU(2) x SU(2) x SU(2) x SU(2), etc. 

However, all these possibilities, except SU(5) and SU(3) x SU(3), can be 
eliminated since they do not have complex representations. It is obvious from 
eqn (14.4) that we must have complex representations for fermions; in the 
standard model the fermion representations are not equivalent to their 
complex conjugates. The remaining possibility, SU(3) x SU(3), can quickly 
be eliminated since it cannot accommodate both the integrally and 
fractionally charged particles. 

Thus SU(5), being rank-4, is the smallest group that can contain 
SU(3) x SU(2) x U(1) without introducing any new fermions. It has 
complex representations and can accommodate fractional charges. As we 
have seen, its anomaly-free reducible representation 5* + 10 has just the 
right quantum numbers to fit one generation of leptons and quarks. Groups 
larger than SU(5) would necessarily involve particles other than the 15 two- 
component fermions with their familiar quantum numbers. In this sense 
SU(5) is the unique theory for the simplest grand unification scheme. 


14.2 Spontaneous symmetry breaking and gauge hierarchy 


The SU(5) model must have two mass scales: the X and W gauge boson 
masses. (For the remaining part of this chapter we shall often refer to the X 
and Y bosons collectively as X, and to the W and Z bosons collectively as W.) 
They characterize the spontaneous symmetry breakings (SSB) of SU(5) to 
SU(3) x SU(2) x U(1) and to SU(3) x U(1). Furthermore, as we shall see in 
the subsequent sections, we must have My larger than My by something like 
12 orders of magnitude. Thus, there is a vast hierarchy of gauge symmetries. 
As we discussed in Chapter 8, the Higgs phenomenon can provide masses 
for gauge bosons with elementary scalar fields developing a vacuum 
expectation value (VEV). This mechanism preserves the renormalizability of 
the theory. In order to have in one theory two mass scales, we must have two 
sets of scalars, and they develop vastly different VEVs which give rise to the 
desired gauge hierarchy. For the SU(5) model, this can be achieved with 
scalars in adjoint (H}) and vector (@;) representations 
SU(5) Ë SU(3) x SU2) x U(1) Ë SUB) x UC). (14.23) 
The general SU(5)-invariant fourth-order potential is 


V(H, $) = V(H) + Vib) + Ag(tr H7)(b") + A5(G'H7@). (14.24) 

with 
V(H) = —mi(tr H?) + A, (tr H*)* + A,(tr H*) (14.25) 
V(b) = —m3(¢'¢) + 13(¢'d). (14.26) 


H is a traceless hermitian matrix. For simplicity we have imposed extra 
discrete symmetries: H — —H and ¢ > —¢@ to get rid of various cubic 
terms. 


14.2 Spontaneous symmetry breaking and gauge hierarchy 435 


We first seek values of H 4 0 that minimize the potential with @ = 0. (This 
corresponds to the first stage of SSB.) Afterwards we look for deeper minima 
at small, but non-vanishing values of o (the second stage of SSB). 

It can be shown that, for 4, >0 and A, > —7/30 4,, V(H) has an 
extremum at H = <H> with 


(Hy =v, 2 (14.27) 


—3 
where 
vi = mî /(60A, + 144,3). (14.28) 


We will only outline here the derivation given by Li (1974). First H can be 
diagonalized by a unitary transformation so that Hi > H, ô}, with 2, H, = 0. 
The equations 0V/0H, = 0 are then cubic equations in the diagonal elements 
H,, which can assume at most three different values. Detailed calculation 
then shows that, for the range of couplings indicated above, the potential V 
takes on its minimum when there are only two different values of H;, which 
can be grouped as in eqn (14.27). 

We shift the field to define a set of new scalars, which, in a matrix notation 
similar to that of (14.22), can be expressed 


| Ay, Ay, 
[H] ~ 2/30 Hy, Hy 
en eae | Hy, Hy; (14.29) 


Hy, Hy Hy; 

HY, Hy, Hy; 
Their mass spectrum can be obtained by a straightforward evaluation of the 
second derivative of V at H = <H>. This is shown in Table 14.1. 


TABLE 14.1 
Scalar fields H: 24 and @:5 in the minimal SU(5) model. 
The massless (Hx, Hy,) are would-be-Goldstone bosons 
which are eaten by (X*, Y*) 


| LH3 Js + 3Ho/ v30 


SU(3) x SU(2) 


Scalar fields quantum numbers [Mass]? 

[Hs]; (8, 1) 204-07 

[13]; (1, 3) 801,07 

Ho (1, 1) 4m? 

(Axa, Hya) (3, 2) 0 

(Has Hy.) (3*, 2) 0 

Pra (3, 1) —m3 + (304, + 445)vi 


Par (1, 2) -m3 T (3044 + 945)vi 
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Since H is in adjoint representation, the covariant derivative may be 
written 
D,H = 0,H + iglA,, H] 


= D,H’ + iglA,, <H)] (14.30) 


where A” is the matrix gauge field of eqn (14.22). Thus the original ‘kinetic 
energy’ term |D,H|? contains a factor of g?|[A,, <H>]|’. This is the mass 
term for gauge bosons. 

From eqn (14.27) it is clear that (Hy commutes with the generators of the 
subgroup SU(3) x SU(2) x U(1). Thus the mass terms for the Gj, W,, B 
fields vanish. The X and Y gauge bosons acquire masses by combining with 
the would-be-Goldstone scalars H, and Hy. We have 


My = My = </(25/2)90. (14.31) 


The fact H develops VEV also affects the @ system through the cross- 
couplings A, and A,. The colour triplet ¢,: (3,1) and the flavour doublet 
pa: (1, 2) components of @ = (Pu, Qar) acquire respective mass terms 


m2 = —m? + (30A, + 915)v?. (14.32) 


Thus after the first stage of SSB all non-zero values of particle masses are 
expected to be of the order v,, namely My, which should be superheavy. 
For the second stage of SSB, SU(3) x SU(2) x U(1) > SU(3) x U(1) at 
250 GeV, we need an SU(2) doublet scalars. We assume that ‘somehow’ 
m2 of this doublet ¢, is vanishingly small (compared to v?). Thus @, will 
survive to low energies (~ 250 GeV) as the superheavy particles (with masses 
of the order v,) decouple. The relevant physics of the light particles is 
described by an SU(3) x SU(2) x U(1) invariant effective potential 


Valha) = —midbidba + loiha). (14.33) 


This is of course nothing but the Higgs potential for the Weinberg—Salam 
model (see Chapter 11). Gauge symmetry hierarchy means that @, develops a 
VEV much smaller than v4, 


ERN, 
(6s) = (> ] 
va = (m2/A3)t ~ 250 GeV. (14.34) 


We should emphasize that the above discussion has been carried out using 
the approximation of treating the two stages of SSB separately. A proper 
minimization of the combined system V(H, @) would yield a VEV <H that is 
slightly shifted from eqn (14.27). This would also break some of the mass 
degeneracies shown in Table 14.1 but all these corrections should be small, 
on the order of (v,/v,). Also the X and Y gauge bosons would have an O(v;) 
mass difference. 

We shall see that the model requires v; 2 10’? v3. It is challenging to 
understand the presence in a theory of such very different mass scales. A 
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small mg, as required by a small v, through eqn (14.34), appears to be 
unnatural since it receives a contribution from the large v, in eqn (14.32). 
Furthermore, there is the problem of a consistent implementation of such a 
large gauge hierarchy in the presence of radiative corrections (Gildener 
1976), i.e., can we restrict light scalar particles to small masses to all orders 
in perturbation? The self-mass of an elementary scalar field is quadratically 
divergent; v, may be looked upon as the cut-off parameter for the low- 
energy effective theory of ġa. One would expect large mass corrections from 
diagrams such as Fig. 14.1 with Am? oc głv?. With such corrections it 
appears that a small scalar mass can be obtained only by fine tuning of para- 
meters at each order of perturbation. This is usually referred to as the gauge- 
hierarchy problem. For a review see, for example, Cheng and Li (1980c). The 
issue is whether one can find a model that avoids this unnatural feature. 
Such a model, if it exists, is likely to have additional symmetries. It is 
often suggested that they may be some form of supersymmetry which for- 
bids quadratically divergent scalar masses. (Here the supersymmetric fermion 
partners to the gauge bosons and Higgs scalars must be included. The 
leading divergence in diagrams such as Fig. 14.1 is then cancelled by the cor- 
responding fermion loops.) No realistic model has yet been constructed and 
the problem of a satisfactory implementation of gauge hierarchy is still an 
open one. 


<H> <H» 


$ $ 
Fic. 14.1. A self-energy diagram for the ¢ scalar particle. 


14.3 Coupling constant unification 


The standard model describes the strong, weak, and electromagnetic 
interactions in the energy range <10* GeV with the three different coupling 
constants: g,, g, and g’ for the gauge groups SU(3), SU(2), and U(1), 
respectively. Thus there is no real explanation of the different strengths 
displayed by the three interactions. One of the great virtues of GUT (in fact 
its original motivation) is that it provides such an understanding (Georgi, 
Quinn, and Weinberg 1974). 

The grand unified theory, by definition, has only one coupling constant 
associated with the unified gauge group. This same coupling constant should 
apply to the subgroups as well. The possibility of different couplings for the 
various subgroups at low energies arises because of spontaneous symmetry 
breaking; the X, Y gauge bosons (or X bosons for short) of SU(5) acquire 
masses and decouple from coupling-constant renormalizations. This decoup- 
ling clearly will have different effects on the radiative corrections of different 
subgroup couplings, giving rise to different effective couplings at low energies 
through the energy dependence determined by the renormalization group 
equation illustrated in Chapter 3. The decoupling of heavy X bosons is 
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reflected in the unequal renormalization group coefficients for subgroup 
couplings. Below the unification scale [My in the case of SU(5)] they evolve 
differently, giving rise to the observed disparities of interaction strengths. 

Before proceeding with detailed analysis we shall first make some 
qualitative statements about the coupling constant unification, which is 
represented graphically in Fig. 14.2. 


Mx 4 


Fic. 14.2. Coupling constant unification. For the scale u > My, where the X boson mass can be 
neglected, the couplings corresponding to different subgroups remain unified. For the scale 
u < Mx, the couplings evolve in a pattern determined by the size of the gauge group. 


(A) Since the energy dependence of coupling constants is only logarithmic 
and, since, in the energy region ~ 10? GeV, g,, g, and g’ are quite different, 
the unification scale My is expected to be many orders of magnitude larger 
than 10? GeV. 

(B) From the analysis of the renormalization group equations, we have 
learned the following facts. For the non-Abelian gauge groups the coupling 
constant decreases with increasing energy and the rate of decrease is greater 
for larger groups. For the Abelian group the coupling constant increases with 
energy. Thus at energies less than Myx the ordering of coupling constants has 
to be g,>g>g'. This pattern is in fact compatible with experimental 
observation. 

(C) Furthermore, below M, the three coupling trajectories in Fig. 14.2 
must have just the correct relative strengths in order to all intersect at one 
point when reaching Mx. This implies a non-trivial consistency condition 
among g,, g, and g’. This relation predicts the Weinberg angle Ow, which 
relates g and g’, in terms of the fine structure constant « and the QCD strong 
interaction coupling constant a, (see eqn (14.57) below). 


We now proceed with the detailed analysis. First we must study the 
relation between the coupling constants of SU(3) x SU(2) x U(1) and SU(5) 
at the unification scale Mx. Consider the covariant derivatives for these two 
groups; for definiteness we shall display those operating on the fundamental 
representations of the groups 


8 3 
D, =6,+i9, Y G2/2+ig Y Wir"/2 + ig'B,Y/2 (14.35) 


a=1 r=1 


14.3 Coupling constant unification 439 


and 
23 


D, = ô, +igs Y AtA2/2. (14.36) 
a=0 


The definition of coupling constants depends on the normalization of the 
generators. For the non-Abelian groups these normalizations are fixed by the 
nonlinear commutation relations of Lie algebra. Thus the Gell-Mann 
matrices {A*}, along with their SU(5) generalized version {4°}, and the Pauli 
matrices {t"} are all similarly normalized. tr{A*A"} = 26”, etc. And we have 


95=93=92=91 (14.37) 
with 

93 = Js, 92 Z9. (14.38) 
The coupling g, is that of the Abelian U(1) subgroup. Thus 

ig,A° Ay = ig’ YB, 

A’ is identified with the B, gauge field. One notes that the U(1) algebra 
does not provide any (nonlinear) restriction on its generator, and Y and 
A° may be normalized differently. We can determine this difference in nor- 


malizations by noting that for the particle assignment of 5 in (14.7) we must 
have weak hypercharge 


— 2/3 
— 2/3 
Y= — 2/3 (14.39) 
l 
l 
Comparing (14.39) to the 2° of (14.2), we have, as in (14.11), 
= —(5/3)72°, g' = —(3/5)*9, (14.40) 


since g’ Y = g,A°. Eqns (11.80), (14.37), and (14.40) can be translated into the 
statement that 
sin? Oy = g° (g? +g'7) = 3/8. (14.41) 


This and the equalities in (14.37) are valid in the SU(5) limit; i.e. for the 
energy scale u > My. Now we need to study the regime u < Mx. The 
evolution of the SU(n) gauge coupling constant is controlled by the 
renormalization group equation (see eqns (10.77)-(10.79)) 


dg, 


3 
T — b,g; (14.42) 
where 
b, = (11n — 2N;)/487? forn > 2 (14.43) 
and 
b, = —N,/24n?. (14.44) 
Thus, 


b, — b, = 11n/48r?. (14.45) 
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We have ignored the contribution coming from the Higgs scalar. N; 
is the number of quark flavours (Np = 6 for a three-family theory) and 
thus the fermion effects on the relative rate of coupling constant evolution 
cancel out. Considering that only quarks couple to SU(3) gluons and that 
both quarks and leptons couple to SU(2) and U(1) gauge bosons, one may 
find it surprising that the fermion contributions should be equal in all gauge 
coupling renormalizations. We will now digress to explain this situation. 


Fic. 14.3. Coupling-constant renormalization due to the fermion loop. 


The fermions contribute through the loop diagram in Fig. 14.3 which is 
proportional to F, = N.gz tr(T;T,), where Nm is the number of multiplets of 
(two-component) fermions coupled to the gauge bosons and all represen- 
tation matrices are similarly normalized with tr(7;T7;) = 36 ;;. We shall first 
explicitly work out the F, factor for each gauge group. For SU(3), N,, is the 
number of colour triplets. Since both quark and antiquark couple to gluons, 
we have Nm = 2Np and F; = Npg36,,. For SU(2), Nm is the number of 
doublets. Since for each lepton doublet there are three coloured quark 
doublets, we have N,,=3(1+3)N; and F,=N,g36,,. For U(1), a 
straightforward summation of (squared) weak hypercharges with proper 
rescaling of the normalization according to (14.40) also yields F, = Neg?. 
From the SU(5) viewpoint, F = F, = F, will no longer appear as a 
coincidental fact: all fermions form a complete (reducible) representation of 
the group, (5* + 10), and all representation matrices for each subgroup 
factor are equally normalized; in addition to this, is the requirement that 
all members of this representation obtain comparable masses « M,. This is 
to be contrasted with the situation for the gauge bosons, which end up with 
two very different mass scales: My « Mx even though they form a com- 
plete SU(5) representation (the adjoint rep). And this split in the masses pro- 
duces the different renormalization effects to the subgroup coupling 
constants. 

We now return to the solutions of eqn (14.42). For g4, g2, and g3, we have 


91 (u) = 91 (uo) + 2b, In(u/no) (14.46) 
9 (HL) = 92 *(Uo) + 2b, In(u/no) (14.47) 
93 (HL) = 93 (Uo) + 2b; In(u/Uo). (14.48) 


We can express the low-energy couplings in terms of more familiar 
parameters by using (11.80) tan 0w = g'/g, (11.93) e = g sin 0w, and (14.40) 


gi(u)/4r = (5/3)a(u)/ cos? Ow (14.49) 
g3(p)/47 = a(u)/sin? Oy (14.50) 
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and 
g3(w)/4n = a (y). (14.51) 
We can recast eqns (14.46)}-(14.48) in the form 
a, (u) = a5 + 8mb, In(u/M,x) (14.52) 
a 4(u) sin? Ow = a5 *.+ 87b, In(u/My) (14.53) 
(3/5) 1(u) cos? Ow = a5 t + 82b, In(u/Mx) (14.54) 


where we have used eqn (14.37). 
gi(Mx) = g2(Mx) = 93(Mx) = 95 and g§/4n = as. (14.55) 


Taking the linear combination [2 x eqn (14.52) — 3 x eqn (14.53) + eqn 
(14.54)], we have 


2a t — 3a t sin? Oy + (3/5)a~ ! cos? Ow 


= 8n[2(b} — b,) — 3(b, — b, )] In(u/Mx) = 0. (14.56) 
The right-hand side vanishes because of eqns (14.43}(14.45). Thus 
sin? Oy = 1/6 + 5a(u)/9a (u). (14.57) 


This is the consistency condition mentioned in Remark (C). The values of the 
coupling constants at y = My where the Weinberg angle is deduced from 
neutral-current experiments are compatible with this prediction. 

Taking the linear combination [(8/3) x eqn (14.52) — eqn (14.53) — 
(5/3) x eqn (14.54)] we have from eqns (14.43}(14.45) 


1 8 

In(M,/p) = ann — | (14.58) 
alu) 3a(u) 

This determines the unification scale Mx. Also, combining eqns (14.57) and 

(14.58), 


sin? Oy = 3/8 — (55/24n)a(u) In(Mx/u). (14.59) 


We should note that in calculating b, we have taken the simplest possible 
threshold behaviour: for an intermediate particle with mass m < u, the mass 
is taken to be zero; for m > u the mass is taken to be infinite and the particle 
decouples. In particular for u > Mx, the SU(5) X, Y gauge bosons contribute 
and the coupling-constant equality of eqn (14.37) is maintained to all orders 
of perturbation theory. For u < My they decouple and the b,s become 
different for subgroups SU(3), SU(2), and U(1). With more careful treatment 
of thresholds we can actually identify Mx with the mass of the X boson. In 
eqns (14.43}(14.45) we have also ignored the scalar contribution. By 
including the Higgs scalar of the Weinberg-Salam model and with more 
careful treatment of higher-order effects, one obtains a set of numerical 
results in the neighbourhood of 


Mx ~ 4 x 10!4 GeV (14.60) 
sin? Ow ~ 0.21 (14.61) 
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for a range of inputs of the QCD scale parameter A ~ 300 MeV. (For reviews 
see Langacker 1981; Marciano and Sirlin 1981.) 


14.4 Proton decay and baryon asymmetry in the universe 


Proton decay 


A prominent feature of GUTs is the nonconservation of baryon number. The 
SU(5) model has this property and the leading effective Lagrangian for pro- 
ton decay arises from a set of tree-level X-boson exchange diagrams. 

The gauge couplings of the 5* + 10 fermions (y', ,;) can be worked out, 
as usual, from their covariant derivatives. With gauge bosons in the matrix 
form A of (14.22) we have 


gyyt Aww + Tr gxy"{A,, X} = — Vr gW, iy" e + āū,” d,) 
+ VigX aale tS "dpa 
+ 6(qyre* — hyt di] +... (14.62) 


where all the fermions are left-handed and g is the SU(5) gauge coupling 
constant. The SU(2) doublets are given the labels 


Xia = (X,, Y,) (14.63) 
daa = (U,, da) (14.64) 
ba Vee) (14.65) 


and ¢,,, and £a are the totally antisymmetric tensors. Once again it should be 
remembered that here we are working in the one-family approximation and 
when more families of fermions (all in the 5* + 10 representations) are 
incorporated, the fermion fields of eqns (14.62), (14.64), and (14.65) will all 
be replaced by suitable linear combinations of fermions having the same 
SU(3) x SU(2) x U(1) charges. A more detailed discussion of mixing angles 
and CP-violation phases will be postponed until the next section; for the 
present it suffices to note the expected presence of such mixing angles and 
phases. 

It is important to notice that X-bosons couple to two-fermion channels 
with different baryon numbers (see Fig. 14.4). In one case they couple to 
quarks and leptons (B, = 1/3)—as such they are referred to as lepto- 
quarks; in the other case they transform quarks to antiquarks (B, = 2/3)— 
they are then called diquarks. Consequently, through the mediation of X- 
bosons, a B = —1/3 channel can be converted into a B = 2/3 one; i.e. a 


q B,=2/3 q€ 
Fic. 14.4. X bosons as leptoquarks and diquarks. 
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Fic. 14.5. Baryon-number violating processes in the lowest-order X- and Y-boson exchange 
diagrams. 


baryon-number changing (AB = 1) process at tree-level occurs (shown in 
Fig. 14.5). 

Since M, is large (compared to all the fermions), we can write down the 
effective four-fermion local interactions from (14.62) 


L np=i = (g7/2M xe"? USYA pa)(daY ule T Ey dap): (14.66) 
We note the following features of this AB = 1 effective Lagrangian density. 


(1) A(B — L)=0. Baryon minus lepton number is conserved; thus 
p > etn? is allowed, but not n >e 2", etc. 

(2) SU(3) x SU(2) x U(1) invariance. All colour and flavour indices are 
contracted and electric charge is conserved in (14.66). This must be the case 
in view of our taking into account only one mass scale (i.e. Mx) and 
effectively treating all other particles including W and Z as massless. 


This suggests that, independently of the validity of the SU(5) model, 
constructing the most general form of the dominant AB = 1 amplitude 
involves listing all the lowest-dimensional SU(3) x SU(2) x U(1) symmetric 
operators (Weinberg 1979d; Wilczek and Zee 1979a). The lowest dimension 
is six as in (14.66) and a higher dimensional operator will necessarily be 
suppressed by extra powers of M, in the denominator. Thus the procedure is 
just an extension of that used by Fermi for the B-decay effective amplitude 
where only a global U(1) charge conservation needed to be imposed. It turns 
out that, if we restrict ourselves to fermions with the familiar SU(3) x 
SU(2) x U(1) quantum numbers as in (14.4), the complete list of such 
dimension-six operators is rather small. (Many of them are related by Fierz 
rearrangement.) When such a list is examined, one finds that the selection 
rule A(B — L) = 0 is still respected by this general amplitude. In fact, as we 
shall see in the next section, in the minimal version of SU(5) with a Higgs 
multiplet in the 5 + 24 representation, B — L is an exact global symmetry 
of the model; hence it holds to all orders of perturbation. 

We also make the parenthetical remark that the four-fermion effective 
Lagrangian in (14.66) contains a AS < 0 rule for proton decay. When the 
second family of fermions is put in, the final d° state in Fig. 14.5 will be 
replaced by Cabibbo-rotated d° and s° states. Thus p > e*K° is allowed, 
but not p > e+K°, etc. This selection rule is merely a consequence of the 
simple quark model because the lowest-dimension AB = 1 operator has three 
quark fields and the strange quark has S = —1. 
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To obtain the proton life-time and branching ratios from #y,_, of (14.66) 
with the mixing angles of the minimal SU(5) we still need to perform two 
more sets of calculations. 


(A) The effective Lagrangian is written down at the mass scale of Mx and 
it must be renormalized down to the typical hadron scale of O(1 GeV). The 
leading logarithmic radiative corrections to Zag=; due to the exchanges of 
SU(3) x SU(2) x U(1) gauge bosons can be estimated using the standard 
technique of the renormalization group discussed in Chapter 3. (An example 
of such calculations can be found in the next section.) This yields an 
amplitude enhancement factor of about 4 (Buras et al. 1978). 

(B) Proton or bound-neutron decay can be viewed as arising from the 
four-fermion local interaction of (14.66) with the final antiquark combining 
with a spectator quark from the initial nucleon to form the final meson 
system. A variety of phenomenological hadron-physics techniques: SU(6), 
relativistic bag models, etc. have been used to evaluate the matrix element 
Lyg-, between the hadron states. (For a review, see Langacker 1981.) 
Unfortunately there is considerable variation in the results so obtained—by a 
factor of about 5 in the amplitude. However most model calculations 
indicate that the minimal SU(5) would have substantial nucleon decay into 
two- or quasi-two-body channels, which should be relatively easy to detect 


p > e*n’, ea, vn", etc. 
n > e*n”, Vn’, e*p7, etc. 
The prediction of proton lifetime ty is clearly very sensitive to uncertainties 
in our calculations of Mx since ty oc M$. As Mx is directly related to the 
QCD scale parameter A, the value of which is still controversial, we should 


thus keep in mind that the oft-quoted SU(5) value ty ~ 10°° years is 
probably uncertain by a multiplicative factor of 10*?. 


Baryon number asymmetry in the universe 


In the remainder of this section we shall discuss briefly how a GUT, which 
predicts proton decay, may also explain why the universe does not seem to 
contain a large concentration of antimatter (Yoshimura 1978; Toussaint, 
Treiman, Wilczek, and Zee 1979; Dimopoulos and Susskind 1978; Weinberg 
1979c). This cosmological asymmetry between matter (baryons) and anti- 
matter (antibaryons) is a puzzle even at the SU(3) x SU(2) x U(1) level. No 
cosmology model can generate a net baryon number if all underlying physical 
processes conserve baryon number. Until the advent of GUT one had to 
impose on the standard model of cosmology an ad hoc asymmetric initial 
boundary condition (see eqn (14.68) below). This appears to many to be an 
unsatisfactory feature of the theory. 

We have already mentioned in §13.2 in connection with our discussion on 
the cosmological bound on neutrino masses how the standard model of 
cosmology provides us with a very satisfactory picture that can account fora 
variety of observational data. In particular the observed 2.7 K degree of 
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background black-body radiation is compatible with the nucleosynthesis 
calculation of the primordial helium abundance. 

However the standard model with only baryon-number conserving inter- 
actions does not fix the ratio of the photon number density n, (corresponding 
to the value at 2.7 K) to the observed nucleon density ny. We must put in by 
hand, as an initial condition, the value 

PN ~ 1079. (14.67) 
Ay 
When the universe was hot enough that baryons (quarks) and antibaryons 
(antiquarks) could be freely pair-created by radiation the above ratio implied 
the baryon-number asymmetry 
gt 9 (14.68) 
Ng + Ng 
where n, and ng are the quark and anti-quark number densities, respectively. 

Why should there be this asymmetry, with this particular value? It would 
be much more satisfying if starting with a symmetric state (or better, 
independent of initial conditions) such an asymmetry could be generated by 
the underlying physical interactions. To have such a situation we must 
postulate a new particle interaction beyond those given by the SU(3) x 
SU(2) x U(1) model. Besides being a baryon-number changing interaction, 
the new interaction must have the following general properties: (i) it must 
violate C and CP conservation; (ii) it must also have the feature that there 
was a certain period during the cosmological expansion when these B-, C-, 
and CP-violating processes were out of thermal equilibrium (Sakharov 
1967). It is clear that charge conjugation and CP symmetries would 
automatically preclude a nonzero value for ô in (14.68) since these operations 
interchange n, and n,. The need for nonequilibrium is perhaps less obvious. 
Heuristically we can understand this by recalling that CPT invariance 
requires particle and antiparticle states to have the same mass, hence to be 
equally weighted in the Boltzmann distribution; thus no CPT-invariant 
interactions can generate a nonzero ô in thermal equilibrium. 

GUTs, such as SU(5) models, have just the required properties to generate 
a nonzero ô. There is a set of B-, C-, and CP-violating processes, related to 
the X-boson couplings (and also to those to the Higgs particles), that can be 
forced out of equilibrium by the cosmological expansion. 

To see how this can happen, we first need to calculate the various reaction 
rates as a function of energy (i.e. temperature). The criterion of thermal 
equilibrium is that the reaction rate be greater than the expansion rate of the 
universe. It turns out that the two-body collisions of Fig. 14.5 do not have the 
required equilibrium—nonequilibrium transition. However the decays and 
inverse decays of heavy X boson do have this transition because they have a 
threshold. For kT > Mx, the X bosons would exist in thermal equilibrium 
with an abundance comparable to ordinary particles (e.g. Ny = N,). This 
mixture of X and X° undergoes B and CP violating decays. This preferentially 
produces quarks relative to antiquarks (see discussions below). Normally 
such a net baryon number would have eventually been ‘washed off’ by 
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inverse decays. However as the universe cools below My (i.e. kT < Mx) the 
number of X bosons (and the inverse decays) are suppressed by the 
Boltzmann factor exp(—M,/kT). The baryon number production is essen- 
tially shut off and the net baryon number produced in the earlier stage is 
‘frozen in’. 

As we have already pointed out, there are two classes of X decay channels 
with different baryon numbers: B, = —1/3 and B, = 2/3. Thus we have the 
following four decay rates for X and its antiparticle X°. 


1 =T(X> lq) with By = —1/ (14.69) 

y2 =I(X >qq) with B, = 2/3 (14.70) 
and 

yi =T(X' >q) with Bi = 1/3 (14.71) 

yi =r(X > qq) with B} = —2/3. (14.72) 


CPT invariance requires that the total rates be the same for particles and 
antiparticles, 1.e. 


Yi + Y2 = V1 + Y2- (14.73) 


However CPT demands that y; = y3 and y,=y5 only in the Born 
approximation. With C- and CP-violation couplings, higher-order inter- 
ference terms such as those shown in Fig. 14.6 can lead to 


Yı — y1 =V — 72 #0 (14.74) 


Consequently, starting with an equal mixture of X and X° their decay 
products can show a net baryon number in an nonequilibrium situation 


60 Y1ıBı + Y2B2 + Y1B1 + y2B2 = (yı — y1)(Bı — B2). (14.75) 


This explicitly displays the B, C, and CP nonconserving nature of ô, which 
should be small because (y; — y{) is necessarily a higher-order term (and 
multiplied possibly by some small CP phases). In fact in the minimal SU(5) 
model (see the next section) y, — y$ receives its first nontrivial contribution 
at the 10th-order of perturbation leading to a 6 many orders smaller than 
the observed asymmetry of 107°. A number of more complicated models 
have been proposed to remedy this. Further development in this direction is 
perhaps premature, since the quantitative calculation of 6 at this stage is still 
uncertain. What is clear is that GUTs such as the SU(5) models have the 
qualitatively correct features that can, in the context of the standard model of 
cosmology, provide a natural explanation for the observed baryon-number 
asymmetry in the universe. 


Fic. 14.6. y; — y5 may receive a nonzero contribution from this fourth-order interference effect. 
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14.5 Fermion masses and mixing angles in the minimal 
SU(5) model 


From our study of the Weinberg—Salam model in §11.3 we have learned that, 
when there are several particles of the same charge, the eigenstates of their 
mass matrix are generally different from those of fields having definite gauge- 
interaction quantum numbers. The mass eigenstates are related to the gauge- 
interaction eigenstates by some unitary transformation. 

In this section all three 5* + 10 fermion families will be included in the 
theory. The fermion fields used in previous sections of this chapter should be 
replaced by gauge eigenstates, which are vectors in a three-dimensional space 
of the family index A = e, p, T. 


e > e4 = 4g €B eg = (e, p, T) 
peas i 
Ve > V4 = 1 4BVB Vg = (V1, V2, V3) 


u > py = Ulep Pg = (u, C, t) 
d > n4 = Vian, ng = (d, s, b). (14.76) 


We have chosen the basis such that charged lepton-gauge eigenstates e} are 
the same as their mass eigenstates e,. For massless neutrinos any linear 
combination of the degenerate fields can be taken as their mass eigenstates; 
hence we can set the unitary matrix T, to be 64, also. Since the mass 
matrices are not necessarily symmetric, the unitary transformations U,, and 
Vg may be different for the left-handed (LH) and right-handed (RH) 
fermion fields. With our convention of always working with LH fermions in 
this chapter it means that we should distinguish between transformations for 
particle and antiparticles. Thus to (14.76) we should add 


u > py = Usps; ph = (uS, c3, t°) (14.77) 
d > nj = Vins nz = (d5, s‘, b°) (14.78) 
and generally U,, Æ US, and Vig 4 Vẹp. 


Kinship hypothesis 


While the ‘family’ is a well-defined notion in terms of gauge eigenstates, 
historically we also have an intuitively sensible grouping of the fermion mass 
eigenstates 


e — family: (e, v,, d, u) 
u — family: (M, v2, S, c) 
t — family: (t, v3, b, t) (14.79) 


with each family ascending on the mass scale. Furthermore this grouping 
scheme is supported by our experience with the weak interaction in the sense 
that charged-current transitions between different families are suppressed by 
small mixing angles (eqn (12.47)). The question naturally arises whether this 
feature can be generalized in a GUT in which there are new flavour-changing 
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currents coupled to the X gauge bosons. Is the family structure (14.79) 
observed at low energies still valid in the SU(5) models? To focus the issue, 
this generalization is called the ‘kinship hypothesis’. Namely, it supposes that 
all interfamily transitions are suppressed by the appropriate small mixing 
angles. For example the kinship hypothesis says that the baryon-number 
changing couplings u > t* or b° are suppressed relative to u > e* or d‘, etc. 
This of course would have important implications for the analysis of proton 
decay. A strong violation of this hypothesis would severely depress the decay 
rate and sharply alter its branching ratio pattern. 

The SU(5) gauge couplings in eqn (14.62) can be transcribed in terms of 
gauge eigenstates according to eqns (14.76), (14.77) and (14.78) 


Woven + Pans) + X(papy + yey” + €4n7) 
+ Y(pyn), + paet + Vani). (14.80) 
We have dropped all extraneous labels except the family index A. Expressing 
these couplings in terms of mass eigenstates we have 


Wiv,e, + p,(UV'") ne] + X[pi(USU") sePs + iy(V)sse3 + eV Japa] 


+ Y[pi(UV") ang + PU) sea + Val V") aN). 
(14.81) 
The KM rotation matrix of (11.124) is just the combination (UV’). It is 
clear that all the six other rotation matrices in the X, Y gauge boson 
couplings are not of this form. Hence in principle we may encounter very 
different mixing angles in these new interaction vertices; the kinship 
hypothesis may not be valid. However we shall display below that in the 
version of SU(5) with the simplest possible Higgs structure, the minimal 
SU(5) model, these new mixings all essentially collapse to the familiar KM 
rotation. 
Given that the fermions are in the 5* + 10 representation, the scalars that 
can form Yukawa couplings must be 


5* x 5* = 10* + 15 
10 x 10 = 5* + 45* + 50 
5* x 10=5+ 45. (14.82) 


It is a straightforward exercise to check that only 5 and 45 have 
components transforming as (1,2) under the subgroup SU(3) x SU(2). 
The minimal SU(5) model has Higgs scalars only in 24 and 5, with 5 
providing the breaking of SU(3) x SU(2) x U(1) ~ SU(3) x U(1) and 
giving masses to fermions. 

We have the following Yukawa couplings 


f Ray) Cep me +J Ray) Co" + h.c. (14.83) 


where C is the Dirac charge conjugation matrix. It then follows from the 
auticommutation of the fermion fields and the antisymmetric property of 
C =iy°y? that the Yukawa coupling matrix f} is symmetric. When 
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the 5 scalar @ develops VEV as in (14.34) 


<> = (0, 0, 0, 0, v2), (14.84) 
the couplings in (14.83) produce terms quadratic in fermion fields 
Vo f GAD aPs) + V2 SGAN + E1468). (14.85) 


The mass matrices for p, n, and e states then have the properties 
vfus = Mh = MYA 
va fG = Mis = MGB. (14.86) 


The fact that the p-quark mass matrix is symmetric implies that UŻU" is a 
diagonal unitary matrix (see (11.109)). The equality of M” = M® is a 
consequence of the SU(4) symmetry of VEV in (14.84). This implies that the 
same biunitary transformations diagonalize both M™ and M. Since we 
have chosen gauge eigenstates and mass eigenstates to be the same for 
charged leptons, it then follows that we can take V and V° to be identity 
transformations also. Thus up to a complex unit matrix all the mixing angle 
matrices in (14.81) are of the form (UV), which is just the KM rotation of 
weak interactions. For the minimal SU(5) model the kinship hypothesis is 
fully realized. 


Lepton—quark mass relations 


Is there any evidence in support of the minimal version of the SU(5) theory? 

The equality of M" = M® implies not only the equality of the diagonal- 
ization matrices but also of their eigenvalues. Hence we have (Georgi and 
Glashow 1974) 


Me = M4 
m, = m, 
m, = M. (14.87) 


Again, like the coupling-constant equality of (14.37), there are SU(5) 
symmetric relations, subject to significant renormalization corrections. 
Perturbation calculation of fermion self-energy yields 


m(u) = m — mg2b™ In(A/p). (14.88) 
Thus the renormalization group equation for the effective mass is 
d In m(u) 
——— = bmg? (p). 14. 
Ting T Pe (14.89) 


This differential equation can be integrated since we know from eqn (10.80) 
the scale dependence of the coupling constant 


g4 (Uo) 


Inl) = 1 + 26,97 (Uo) ln(u/Ho) 


(14.90) 
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which is of course the solution to eqn (14.42) 


d g(u) 
oe = —2b,g8(u). (14.91) 
We obtain the result 
mu) ae hea 
M(Uo) Jn(Ho) 


where the b,s are given in eqn (14.43). We now proceed to calculate b” 
using Fig. 14.7 


3 
bm” — E= Y (T°T");; (14.93) 
L L R R 


Fic. 14.7. Self-energy diagram for fermions. 


where the 7’s are the representation matrices appropriate for the fermions. 
For SU(n) with n > 2, 


ae n? — | 
and, for the U(1) case (eqn (14.40)), 
37 YN 
ON 2 ce : 
= 5(3) (14.95) 
Using the expression for b, in eqns (14.43) and (14.44), we obtain 
m (u) _ [ae p= | a z (14.96) 
M,( Lo) 93(Uo) 91 (Uo) 
mM,(Uo) 93(Ho) 91(Ug) 
m,() ae lia 
z F 14.98 
m,(Ho) 91(Uo) 


where Np is the number of quark flavours. We note that there is no 
contribution from SU(2) gauge bosons because RH fermions are all singlet 
under SU(2) and the diagram in Fig. 14.7 involves one helicity flip. Dividing 
eqn (14.97) by eqn (14.98) and using eqn (14.87) for (m, (Mx) = m(M,x)), we 
obtain (with g3(Mx) = 9;(Mx) = 95(Mx)) 


m,(u) = Pra Trey | a fe (14.99) 
m.(L) 95(Mx) 95(Mx) 
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If we take the effective current quark mass at the qq threshold 


H = My = 2m, (Mn) (14.100) 


we then obtain for n, = b, e} = t, and u œ 10 GeV 


ee (14.101) 
mM, 
which must be regarded as a successful prediction of the theory (Buras et al. 
1978). 

It is not clear how to properly evaluate the renormalization effects for 
lighter fermions since a smaller value of scale parameter u must be involved. 
However, if we merely examine the renormalization-group invariant ratio as 
implied by eqn (14.87) 
+. —, (14.102) 

Me Ma 
we see that the two sides differ by something like a factor of 10, since current- 
algebra calculations indicate that m,/m, ~ 20. Does this ‘failure’ definitely 
preclude the minimal SU(5) as a viable theory? Such a strong conclusion is 
perhaps unwarranted because, for the very light quark d (ma ~ 7 MeV), the 
renormalization group equation perturbative calculation is not expected to hold. 
Thus the correctness of the minimal SU(5) model is still an open question. 


B — L conservation 


We close this section with a remark about another aspect of the minimal 
SU(S) model: it still possesses a global U(1) symmetry, corresponding to the 
conservation of B—L, baryon minus lepton number (Wilczek and Zee 
1979b). It is clear that the X, Y gauge bosons conserve B — L if we assign 
them the quantum number of B— L = 2/3. We need to check that the 
Yukawa couplings of (14.83) have this symmetry which is not spoiled by the 
spontaneous symmetry breaking of (14.84). 
We can write (14.83) symbolically as 


f( 10, )(10-)(5g) + £'(10-)(5*)(5¥). (14.103) 


It is easy to check the conservation of a new charge (call it F). The first term 
requires 


F(5,) = —2F(10,); (14.104) 
then the second term implies 
F(5,) = 3F(10,). (14.105) 


This means that the conserved U(1) charge may be written in terms of 
number operators 


F = (1/5)[N(10,) + 3M(5,) — 2N (54)]. (14.106) 


The normalization factor 1/5 is a convention. It is clear that <> of (14.84) is 
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not invariant under the U(1) generated by the F-charge of (14.106) nor under 
the U(1) generated by the weak hypercharge as neither 

l 


F(54) = —2/5 l 


and Y(54) = 29/3 (14.107) 


annihilate the vector in (14.84). However the linear combination F’ = F + 
¿Y has the property that F’(5,)(¢) =0 and is thus still conserved after 
SSB. We note that F’ is no other than B — L since 


1/3 
1/3 
F'(5,) = 1/3 : (14.108) 


1 


Just as B and L are accidental global symmetries in the standard SU(3) x 
SU(2) x U(1) model, the minimal SU(5) violates B and L conservation 
but still preserves the combination B — L. 


15 Magnetic monopoles 


The modern theory of magnetic charges was first formulated by Dirac over 
50 years ago. Their existence has been under active experimental investiga- 
tion ever since. An intriguing aspect of non-Abelian gauge theory is that it 
has objects with the properties of magnetic monopoles (t Hooft 1974; 
Polyakov 1974). It is expected that monopoles associated with the spon- 
taneous symmetry breakdown of grand unified gauge theories should be 
superheavy (~ 101° GeV?) and such objects may well have escaped detec- 
tion. The recent upsurge of an extensive search for magnetic monopoles has 
been prompted further by the possible evidence reported by Cabrera (1982). 
However, even if this first sighting 1s not confirmed by other experiments, the 
theoretical studies of monopoles should lead to a better understanding of the 
structure of non-Abelian gauge theories with spontaneous symmetry 
breaking. 

In this chapter we shall present a brief introduction to the gauge theory of 
magnetic monopoles. In §15.1 we review the properties of the monopoles as 
originally proposed by Dirac (1931). We then discuss in §15.2 the general 
features of finite-energy solutions (solitons) to the equation of motion in field 
theory. In §15.3 we will illustrate some aspects of the °t Hooft—Polyakov 
monopole solution in non-Abelian gauge theory, which is a synthesis of the 
Dirac monopole and the soliton solution. It should be emphasized that our 
discussion will be kept at an elementary level and mathematical rigour is 
often sacrificed for simplicity of presentation. For excellent reviews on this 
subject the reader is referred to articles by Goddard and Olive (1978) and by 
Coleman (1975, 1981). 


15.1 Dirac’s theory of magnetic poles 


In this section, we will review the properties of the magnetic monopole as 
originally introduced by Dirac. (For a more detailed discussion, see Goddard 
and Olive 1978.) 


Classical electromagnetism and duality transformations 


We start from classical electromagnetism which is well described by 
Maxwell’s equations 


V-E=p VxB-0E=j (15.1) 
VB=0 VxE+ôB=0. (15.2) 


In terms of the electromagnetic field tensor F", these equations can be 
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combined into covariant form, 


Ore = — j" (15.3) 
OF = 0 (15.4) 

where 
ji'=(9,j),  F*=E, FY = —eikBk (15.5) 


and the dual field tensor is defined as 
Puy = Sete F g. (15.6) 


In vacuum, where j, =0, Maxwells equations (15.3) and (15.4) are 
symmetric under the duality transformation 


Fey, Pe, PP o F” (15.7) 


which corresponds to the interchange of electricity and magnetism E — B, 
B — —E. This symmetry is broken by the presence of the electric current j, in 
eqn (15.3). One can introduce the magnetic current k“ = (ø, k) on the right- 
hand side of (15.4) so that the modified Maxwell’s equations read 

Od ay ð F” = — k". (15.8) 


u 


They will be symmetric under the duality transformation (15.7), sup- 
plemented by the substitution 


jt a kë, kh > =j". (15.9) 


That is (F, F), (E, B) and (j, k) are ‘duality vectors’ and the transformation is 
a rotation (by 90°) in such two-dimensional planes. (These duality rotations 
will be useful in translating the familiar electromagnetic results involving 
electric charges into those involving magnetic charges.) The introduction of 
the magnetic current k” requires the existence of magnetically charged 
particles, the magnetic monopoles. In analogy to the electric current produced 
by point particles at x; with charge q; 


JAX) = 2 qi [ax 6*(x — x,), (15.10a) 


the magnetic current due to point magnetic charges with strength g; is given 
by 


k(x) = 2 gi fax ôt(x — x;) (15.10b) 


where the line integrals in (15.10a) and (15.10b) are taken along the particle 
world lines. 


Consistency of the monopole with quantum theory 


On the classical level, the magnetic monopole seems to be on the same 
footing as the electrically charged particle and it is mysterious that 
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monopoles are not seen in the laboratory. It was first pointed out by Dirac 
that on the quantum level the existence of the monopole will lead to the 
condition 


Nl 


n where n is an integer. (15.11) 


JG 


This is the famous Dirac quantization condition which implies charge 
quantization, i.e. the possible value of the electric charge carried by any 
particle is an integral multiple of some basic unit (cf. discussion in §14.1). 
This can be seen as follows. Consider the case where particles may carry 
either electric or magnetic charge but not both. The possible values of the 
electric and magnetic charges are denoted by q; and g;, respectively. The 
Dirac quantization condition (15.11) implies that 


where n;; is an integer. Then for any fixed magnetic charge g,, all electric 
charge q; must be an integral multiple of 27/g,;. Similarly, for any fixed 
electric charge q;, all magnetic charge must also be multiple of 27/q;. If we 
assume that there exists a smallest electric charge gp and a smallest magnetic 
charge gp, we have 


qi = Mido, (15.12) 
Ji =NGo> (15.13) 
and 
dogo |l 
„n 15.14 
4r 2 K ( ) 


where n;, n;, and no are integers. Condition (15.14) also implies that the 
interaction between two magnetic monopoles which is proportional to 


nz (An \? 
g ~q <|>) ~ (no/20)*48 (15.15) 
4 do 


is roughly « * ~ 104 times stronger than the interactions between the 
electrically charged particles. In other words, the smallness of the electric 
charge coupling g2/4x ~ a ~ 1/137 implies a very strong interaction between 
monopoles, and it will be much more difficult to pair-produce the magnetic 
monopoles than the electrically charged particles. 


Angular momentum and the Dirac quantization condition 


One heuristic way to ‘derive’ the Dirac quantization condition (15.11) is to 

study the motion of a charged particle in the field of a monopole. The 

magnetic field due to a monopole of strength g fixed at the origin is given by 
g A 


=z a1 (15.16) 
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where f is the unit vector in the radial direction. The motion of a particle with 
mass m and electric charge q in this field is given by 


mi = git x B (15.17) 


We can calculate the rate of change of its orbital angular momentum, 


d e oe qg ° d qg A 
= x — = = —— e 
fF (r x mr) =r x mr ae rx(rxr J7 (£ f 
This suggests that we can define the total angular momentum as 


J=rxm -£e (15.18) 
An 


so that it is conserved. The second term in eqn (15.18) can be interpreted as 


the angular momentum due to the electromagnetic field, because using 
(15.16) we have 


Len = [ae x (E x B) 


_ FI | aw. Ee = L3 
-2 fa x(V- E) An 


Thus the conservation of the total angular momentum J means that angular 
momentum is passed back and forth between the particle and the field in the 
presence of electric and magnetic charges. When we quantize the theory we 
would expect the components of J to satisfy the usual angular momentum 
commutation relations. This would imply that the eigenvalues of J; are half- 
integers. Since we expect the orbital angular momentum, the first term in 
(15.18), to have integral eigenvalues, we get 
qg l 


iG (n = integers) 


which is just the Dirac quantization condition of eqn (15.11). 


The Dirac string 


A rigorous derivation of the Dirac quantization condition comes from 
considering the quantization of the motion of a particle in a given 
electromagnetic field. In the usual quantization of the electromagnetic field in 
the absence of the monopole, we write the electromagnetic field tensor F,, in 
terms of the four-vector potential A, = (@, A) as 


F,, = 0,4, — 0,A, 
or 


B=VxA, E=-Vd-— (15.19) 


so that the equation aË, = 0 is automatically satisfied. Then Schrödinger’s 
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equation for a particle moving in the electromagnetic field has the form 


od, ay 
È Cee e i (15.20) 


which is invariant under the gauge transformation 
1 
A(x) > A(x) + : Va(x) 


W(x) > y(x) 


with a(x) being an arbitrary function. Thus the four-vector potential A, 
plays a crucial role as the basic dynamical variable in the quantization. 
But, if the monopole exists, the vector potential cannot exist everywhere 
because F¥” satisfies eqn (15.8) rather than eqn (15.4). Does quantum 
mechanics preclude the existence of magnetic monopole? Dirac overcame 
this difficulty by introducing the concept of a string. To see this, consider 
the magnetic field of the monopole given in eqn (15.16). For any closed 
surface containing the origin, we have 


j= pB-ds (15.21) 
S 

It is clear that B cannot be written as V x A everywhere; otherwise the 
integral in (15.21) would be zero. However, we can define an A such that B is 
given by V x A everywhere except on a line joining the origin to infinity. To 
see that this is possible, consider the field due to an infinitely long and thin 
solenoid placed along the negative z-axis with its positive pole (with strength 
g) at the origin. Its magnetic field would be 


B = —— f + gO(—z) 6(x) ô) (15.22) 
Anr 
where Z is a unit vector in the z-direction. This magnetic field differs from the 
field of the magnetic monopole (eqn (15.16)) by the singular magnetic flux 
along the solenoid, the second term in (15.22). Since the magnetic field given 
in (15.22) is source-free (V - B,,, = 0), we can write 


Boa = V x A. (15.23) 
Then from eqns (15.16), (15.22), and (15.23), the monopole field is given by 


2 


g 
~ Anr? 
with a pictorial representation shown in Fig. 15.1. The line occupied by the 
solenoid is called the Dirac string. It is not difficult to see that the vector 
potential A of the solenoid can be written, with the conventional definitions 
of polar and azimuthal angles, as 


I Ea 2 °) ô (15.25) 


~ Anr sin 0 


B r= V x A — g0(— z) (x) 6(y)z (15.24) 
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which is singular on the negative z-axis. Eqn (15.24) means that the 
monopole field can be represented by a vector potential A together with a 


string. 
X l X ’ 
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It should be emphasized that the Dirac string is not observable. For 
example, the Dirac string does not give rise to the Aharonov-Bohm effect 
(1959), which is the quantum-mechanical interference of charged particles in 
a region with B = 0 but A # 0. The condition for the absence of Aharonov— 
Bohm effect is that for a charged particle (q) in the two-slit experiment reach- 
ing the detector screen via two distinct paths labelled by 1 and 2 around 
the Dirac string is that 


exp (i fa a)y: + exp( i [acai 


This is precisely the Dirac quantization condition (15.11): 


2 


= |W. + WI’. 


qg = q d Adl = 2am. 


where the closed path consists of path 1 and (the reverse of) path 2. 

Another way to see that the Dirac string is unobservable is to show that it 
can be moved around by a suitable gauge transformation. We demonstrate this 
as follows. The vector potential given in (15.25) is not unique for the solenoid 
field. If one makes a non-singular gauge transformation A > A + Vy where 
y is a non-singular, single-valued function of position, then the V x A term 
in eqn (15.24) will remain unchanged and so must the Dirac string term. To 
move the string we need to extend the concept of the gauge transformation. 
Rewrite eqn (15.24) for the magnetic field of the monopole as 


Bir) =V x A+h(G,r) (15.26) 


where h(%, r) represents the contribution of the Dirac string along some curve 
@ going from the origin to oo and has a flux of strength g, 


h(@,r)=g | ox 6°(r — x). (15.27) 


Consider another string @’ running from the origin to œ along curve @’. 
Let T denote the curve —@’ (@’ taken in the reverse direction) followed by @. 
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We may treat this as a closed path, either by making suitable assumptions 
about what happens at infinity or by assuming that @’ differs from @ only 
over a finite range (see Fig. 15.2). Let Q(r) be the solid angle subtended at r by 
some particular surface spanning I’. Various choices of the spanning surface 


FIG. 15.2. 


will lead to values of Q differing by multiples of 47, but will yield the same 
value for VQ. Now consider the extended gauge transformation defined by 


ik Sp R= Ae 220: (15.28) 
4r 


Note that Q(r) is a multi-valued function and is ill-defined for r on I. Then 
V x A' = V x A = B except on the two strings. Applying Stokes’ theorem to 
a small loop encircling any point on I we see that the flux of V x (A’ — A) 
along I is 


[v x (A' — A)-do = ia’ —a)-at = £ pvas g ban=g 
4r 4r 
: 6, 6, 


oO; 6, 


(15.29) 


where we have used eqn (15.28). From eqn (15.27), we can write eqn (15.29) 
as 
V x (A’ — A) = h(@,r) — h(@r) 
or 
B=VxA+h(@,r)=V x A’+h(@,r). (15.30) 
This shows that the gauge transformation (15.28) moves the Dirac string 


around. The arbitrary position of the string shows that it is unphysical or, as 
one says, the Dirac string is a gauge artefact. 


Derivation of the Dirac quantization condition 


Eqn (15.11) follows from the crucial requirement that the generalized gauge 
transformation (15.28) acting on the wavefunction of the particle must be 
given an equivalent quantum-mechanical description. Thus, under the gauge 
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transformation (15.28), the change in the wave function 
yoy = exp( -i 4 ay (15.31) 


should not produce a multi-valued result. Since there is an ambiguity of 4r in 
Q(r), we must have 
qg = 2nn 


which is just Dirac’s quantization condition (15.11). 


Comment 1 To make the above derivation more transparent it will be 
helpful to work out the example of vector potential A and A’ for strings @ and 
¢@' being the negative and positive z-axis respectively. We then have the explicit 
result from eqn (15.25): 


A-dr = “(1 — cos 0) dọ 
An 


A'-dr = —2 (1 + cos 0) dd. 
4n 
From their difference one can immediately verify the gauge function as 
g g 
= — (2¢) = —Q. 
5 4n (ap) 4n 


Comment 2 We can also avoid the notion of singular potentials and gauge 
transformations by using different monopole potentials in different regions 
of space, namely A and A’ defined over domains with @ and @’ (€.g. negative 
and positive z-axis) respectively excluded. The Dirac quantization condition 
then follows from the requirement that in the overlapping region of the two 
domains the potentials A and A’ should be connected by a single-valued 
gauge transformation (Wu and Yang 1976). While this is an elegant 
formulation, in most practical calculations it is still more convenient to use 
the singular Dirac string potential. 


15.2 Solitons in field theory 


To prepare for discussion of the monopole in non-Abelian gauge theory, we 
will first present an elementary introduction to the classical finite-energy 
solutions of field theory, generally called solitons. (For further discussion, see 
Coleman 1975; Rajaraman 1975.) We will illustrate this by taking as an 
example A@* theory in one space and one time dimension. The Lagrangian is 
given by 


L= | op) — 2p) — V(p)] dx (15.32) 


where 


A 
Vib) =5 (6? - ay (15.33) 
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and 
aè = p*/d. 


The Hamiltonian is given by 


H = [Bew + 3(6,6)° + V(p)] dx. (15.34) 


As we discussed in §5.3, the classical ground-state configuration for the case 
2 e 
u“ > 01s 


$= +a= +e (15.35) 


and the ground-state energy is E = 0. An interesting feature of this model is 
that there exists a static (time-independent) finite-energy solution to the 
equation of motion (solitons). The time-independent solution to the equation 
of motion can be obtained from the Lagrangian L through the variational 
principle, 


=6L226 | dx[4 p) + V()] = 0 (15.36) 


Mathematically, this is equivalent to the problem of motion of a particle of 
unit mass in a potential — V(x), where the equation of motion is derived from 


' 1 (dx\? 
ô faz = 0 fad; (=) + ro | = 0. (15.37) 


Every motion of the particle in the potential — V(x) corresponds to a time- 
independent solution of the field equation. However, not all of these 
solutions are of finite energy. To get a finite-energy solution, we must require 
@ to go to a zero of V(d) as x > +œ, so that the energy integral in eqn 
(15.34) is finite. In the particle problem, this corresponds to the condition 
that the particle must go to the zeros of the potential as t + + œ. Of course, 
the ground states where the particle sits at x = a or —a for all times will 
satisfy this requirement, but there are also non-trivial motions which also 
satisfy this requirement. Finiteness of energy requires the solution to take on 
the vacuum value (+a) at t = +00 but since we have a system of degenerate 
vacua the solution may take on different minima (+a or —a) at different 
infinity points (+ œ or — oo). Thus for example there are motions where the 
particle starts on top of one hill and moves to the top of the other and has 
zero energy (Fig. 15.3). We can use this property of zero-energy motion to 


—V(g) 


Fic. 15.3. 


462 Magnetic monopoles 15.2 


find the explicit form of the finite-energy solution to the field-theory case. 
From the energy conservation for the motion of the particle with zero total 
energy we have 


1 (/dx\? 
(3) +E- Vo= 0 


which corresponds to 


$) - Vid) (15.38) 


for the case of field theory. Eqn (15.38) can be solved easily by integration 
and the result is 
o 


x= + | wpro” (15.39) 

$o 
where ġọ is the value of @ at x = 0 and can be any number between a and 
—a. The presence of this arbitrary parameter @ọ is due to the translational 
invariance of eqn (15.38), i.e., if @ = f(x) is a solution, then ọ = f(x — c) is 
also a solution where c is an arbitrary constant. For the case of A¢* theory, 


the potential is given in eqn (15.33) and the finite-energy solutions in eqn 
(15.39) can be written as 


h(x) = a tanh(px) (15.40) 
o_(x) = —a tanh(px). (15.41) 


The @, is usually called the kink and ¢_ the anti-kink. The energy of the 
kink (or anti-kink) can be calculated from eqns (15.40) and (15.34) to give 


E = 43/3, (15.42) 
which is indeed finite. It is clear that as x > +00, d, (or @_) approaches the 
zeros of V(@), i.e. 

og, > ta as x> +0. (15.43) 


This behaviour is illustrated in Fig. 15.4. 
These solutions can be shown to be stable with respect to small 
perturbations even though they are not the absolute minimum of the 


$, (x) 


FIG. 15.4. 
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potential energy V(@) (i.e. @ # +a for all x and t). The physical interest in 
these finite-energy solutions to the equation of motion comes from the fact 
that they resemble a particle, with structure, in the following respects. 


(1) Its energy is concentrated in a finite region of space. This is because 
these solutions @, deviate from the ground-state configuration, ¢ = +a 
(zero energy) only in a small region around the origin. 

(2) It can be made to move with any velocity less than unity. This is due to 
the fact that the equation of motion 1s Lorentz-covariant and we can apply a 
Lorentz boost to obtain a solution with non-zero velocity. 


Topological conservation laws 


The finite-energy solutions in A¢* theory in 1 + 1-dimensions have a rather 
interesting topological property which will make these solutions stable. This 
topological property can be easily generalized to a more complicated theory 
in higher dimensions and is very useful in finding stable finite-energy 
solutions. 

The topological property of the kink (or anti-kink) solution in 1d* theory 
in two-dimensional space-time can be studied as follows. From the finite- 
energy requirement we have at spatial infinities 


(œ) — @(— œ) = n(2a) (15.44) 
where n = 0 corresponds to the ground state, n = 1 to the kink solution, and 
n = —1 to the anti-kink solution. Eqn (15.44) can be written as 

+ œ 
| (0 Q) dx = n(2a). (15.45) 


Thus, if we define the current j, as 

Jax) = Euv Oo, 
the current will be automatically conserved because ¢,,, is antisymmetric. The 
corresponding conserved charge is just eqn (15.45), 


+ 00 


O = | Jo(x) dx = | ô p dx = n(2a). (15.46) 


This implies that the kink number n in (15.44) is a conserved quantum 
number. Thus there is no transition between kink (or anti-kink) solutions 
and ground states and they are stable. This conservation law, usually called 
the topological conservation law (Lubkin 1963), has a different origin from 
the usual Noether conservation laws (such as energy conservation) coming 
from the symmetry of the theory (see §5.1) in that it holds independently 
of the equations of motion. One intuitive way to understand this topological 
conservation law is that in order to convert the kink configuration (ġ, or 
@_) to the ground-state configuration where ġ = a or —a for all x, we have 
to change the value of @ from —a to a by penetrating the barrier around 
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go = 0 over an infinite range of x. This clearly will take an infinite amount 
of energy. 

Thus the topological conservation law (eqn (15.46)) divides the finite- 
energy solutions into many distinct sectors; n = 0 (vacuum), n = 1 (kink), 
and n = —1 (anti-kink), etc. These different sectors can be characterized by 
their topological properties as follows. In two-dimensional space-time, the 
spatial infinities consist of two discrete points +00. Denote this set by S. 
For A¢* theory, the set of minima of the potential given in (15.35) also 
consists of two discrete points +a and will be denoted by Mo, 


Mo = {¢: Vid) = 0}. (15.47) 


The condition that the solution to the equation of motion must be of finite 
energy will imply that the asymptotic values of d(x) must be zeros of V(@), 
1.€. 

lim (x) = ¢6€ Mp. (15.48) 

x> +o 

This condition can be considered as a mapping from points in S to Mo. For 
example, in the ground-state configuration, both +00 are mapped to a (or 
—a), while the kink configuration ġ, maps +œ to +a and — œ to —a. 
These are topologically distinct mappings in the sense that it is impossible to 
continuously deform one mapping to the other. (For further discussion see 
§16.1.) This is the essence of the topological conservation laws. These 
topological properties will be very useful for more complicated theories in 
higher dimensions where explicit solutions are hard to come by. 

To summarize, in A¢* theory in two dimensions, the finite-energy solutions 
to the equation of motion exist with non-trivial topological properties and 
these solutions are stable with respect to decaying into vacuum. It is clear 
that the existence of this type of topological, stable, finite-energy solution 
requires the theory to have degenerate vacua (spontaneous symmetry 
breaking) and non-trivial topological properties. 


Solitons in four dimensions 


So far we have only discussed the finite-energy solution in two-dimensional 
field theory. In the more realistic four-dimensional field theory we shall see 
that, in order to have topologically stable finite-energy solutions for the 
scalar field, long-range fields of magnetic type must be present. This will lead 
naturally to non-Abelian gauge theories. 

First consider the finite-energy solution to a scalar field theory with some 
symmetry G in four dimensions. Write the Lagrangian density 


L = 36,9; — Vid.) C22) 
with 


V(g;) = 0. (15.50) 
Denote by M, the set of values of ¢; = n; which minimize the potential V(@;,), 
Mo = (9: = ni: Vini) = 0}. (15.51) 
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Note that all the ¢,s in Mọ are constant, independent of the space-time 
points x,. We will assume that these values are related by the symmetry 
group G. For example, the points ¢ = +a in the case of Ad* theory are 
related by the symmetry ¢ — —@. The possible directions in which the 
spatial coordinate r can go to infinity are labelled by a unit vector in the 
three-dimensional space, 

S? = {ff = 1}. (15.52) 


It is clear that S? is a sphere in three-dimensional space. We say S? is a two- 
dimensional sphere, or two-sphere. Unlike the two-dimensional case where it 
consits of two discrete points (+ 00), S? is a connected set. This will make the 
topology of the spatial infinities in four-dimensional field theory very 
different from that of two-dimensional field theory. The energy for a given 
field configuration is given by 


H = [eog + 2(VA:) + V(ġ:)]. (15.53) 


The condition that the solution to the equation of motion has finite energy 
implies that as r > œ, d; approaches one of the zeros (minima) of V(¢;), i.e. 


o2(f) = lim O(RR)E Mp. (15.54) 


Note that for the ground-state configuration ¢;° goes to the same value in all 
directions. S? being connected, ¢%° would have to be constant (as it must be 
continuous) if Mo is a discrete set (i.e. G is a discrete symmetry group). Then 
pġ? has the same topology as the vacuum configuration and is topologically 
trivial. To get a topologically non-trivial solution, Mọ must be a manifold 
with non-zero dimension. This requires the symmetry group G to be 
continuous. From eqn (15.53), the energy is bounded from below by 


H > OERA + V(9;)]. (15.55) 

Write the gradient term (Vd)? as the sum of a radial and a transverse term 
2_(9¢\ . « 2 

(Yọ) = Be + (Ff x Voy. (15.56) 


Since at infinity, 7° is a function of direction f only, if 7 is not a constant, 
the second term in (15.56) is of order r~? as r > œ. This will make the 
integral in H (15.55) divergent. Hence with scalar fields alone there are no 
topologically stable finite-energy solutions in four dimensions (Derrick 
1964). 

It turns out that this difficulty can be circumvented by adding gauge fields 
to the theory, i.e. by making the symmetry G a local symmetry. In this case 
the gradient term V;ġ is replaced by the covariant derivative 


D; = Vid + ig(A;-T)?. (15.57) 


It is then possible (through subtle cancellation) to have D,@ decrease like r~? 
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while A? and V,¢ both decrease like r~! such that the energy integral is 
convergent and the solution has non-trivial topological properties. Note 
that, for such time-independent solutions, the gauge field 4? decreases as r~t; 


the field strength will decrease as r~* which will correspond to a long-range 
magnetic field. (Cf. eqn (15.25).) 


15.3 The ‘t Hooft—Polyakov monopole 


As we mentioned in the last section, it is possible to find topologically non- 
trivial, finite-energy solutions in gauge theory with scalar fields. Since the 
electromagnetic interaction is described by an unbroken U(1) gauge 
symmetry and the finite-energy solution requires spontaneous symmetry 
breaking (the degenerate vacuum), we are led naturally to non-Abelian gauge 
theory in which the electromagnetic U(1) symmetry is embedded. Then the 
monopole will come out as a topologically non-trivial finite-energy solution. 
In this section, we will describe such a solution discovered by ’t Hooft (1974) 
and Polyakov (1974), which has the properties of the magnetic monopole. 
The simplest example is the SO(3) model due to Georgi and Glashow 
(1972a). Even though this model is ruled out experimentally by the discovery 
of the neutral-current phenomena (see §11.2), it is the simplest example of a 
non-Abelian gauge theory having monopole solutions. And by studying the 
embedding of this SO(3) > SO(2) [i.e. SU(2) > U(1)] solution, one can also 
obtain monopoles occurring in theories of larger gauge groups—as we shall 
do so for the SU(5) grand unification model. Thus we concentrate first on 
this fundamental SO(3) case. 


Soliton solution in the SO(3) model 


The Georgi—Glashow model is based on the SU(2) gauge group with a 
triplet of Higgs scalars ọ (for the basic monopole solution we shall ignore 
the fermion fields). The Lagrangian density is 


L = — GFE Fay + 7(D"d) (D.o) — Vio) (15.58) 
where 
Fiy = 0,A% — 8,45 — ee™ A? AS (15.59) 
(D, o) = 0,67 — ee™ AR" (15.60) 
À 

Vid) = 7p- a). (15.61) 

The equations of motion are 
(DF )a = — eea P D"A). (15.62a) 
(D*D,®), = — AQP O = a’). (15.62b) 


In this model, the values of @ which minimize the potential V(@) in eqn 
(15.61) are given by 
My = {6 = 0397 = a°}, (15.63) 
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i.e. M, consists of points on a sphere in a three-dimensional internal 
symmetry space, and all points in Mọ are equivalent to each other by SO(3) 
transformations. For convenience, we choose 


= (0, 0, a) (15.64) 


to be the ground-state configuration. The pattern of the spontaneous 
symmetry breaking is as follows (see discussion in §8.3) 


SU(2) ~ SO(3) > SO(2) = U(1) 


because ® given in eqn (15.64) is still invariant under the rotation around the 
3-axis (SO(2) transformation). We will identify the unbroken U(1) sym- 
metry as the electromagnetic interaction and the corresponding massless 
gauge bosons as the photon, A? = A,. The electric and magnetic fields are 
then given by 
FẸ = E', Fł = clEBE (15.65) 
To get the finite-energy solution we require that as r— oo, o(r) 
approaches values in M,. Since the spatial infinities also form a two-sphere 
S? (see eqn (15.52)) which has the same topology as the set Mọ, we can map 
each point in S° to the corresponding point in S? of Mọ to get a non-trivial 
topology 
Oo” =n; = af; (15.66) 


It is not hard to see that this mapping cannot be deformed continuously into 
the mapping for the vacuum configuration where the whole S? is mapped to 
a point given in eqn (15.64). Thus, the mapping given in (15.66) is 
topologically stable. (For further discussion on mappings with nontrivial 
topology such as St? > St and S? > S? see §16.1.) 

Given (15.66) ¢, oc f,, the finite energy requirement that D,@, = 0 up to 
order r7? (see discussion at the end of the last section) implies the gauge field 
asymptotic form of A? o ¢,,,°;. Believing that the lowest energy solutions 
correspond to those with the maximal symmetry, we can make the following 
ansatz for the explicit solution 


b j 
bs = H(aer), Ai = —&y, [1 —K(aer)], 42 =0 
1 1 (15.67) 


where H and K are dimensionless functions to be determined by the equation 
of motion. A pictorial comparison of the @ field configuration for the 
vacuum (15.64) and the monopole (15.67) is given in Fig. 15.5. For the ansatz 
given in eqn (15.67), which is time-independent, the energy of the system is 
given by 


4ra a dK\* 1/ dH = f 
p= Ele) re-a) + 5 (K* -1 
(0) 


+ K?H? + 5 (H? — ey | (15.68) 
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Fic. 15.5. @ configurations (a) for the vacuum; (b) for the monopole. 


where č = aer. The conditions for E to be stationary with respect to the 
variations of H and K are 


2 d*K 2 2 
a Jë ~ KH? + K(K? — 1) (15.69a) 
2 
4 £ a = 2K°H+ 4 HEP — &), (15.69b) 


These equations of motion for H and K can also be obtained from 
substituting the ansatz (15.67) into the equation of motion given in (15.62). 
The asymptotic condition (15.66) will imply that 


H(é)~E as E> œ. (15.70a) 
Then in order to have a convergent integral in eqn (15.68), we require 
K(é) ~0 as E> œ (15.70b) 
and 
H<oO(é, K(é-1<0O(@ as €-0. (15.70c) 


It turns out that solutions to eqn (15.69) with boundary conditions (15.70) do 
exist and the functions H and K have the shapes shown in Fig. 15.6. The total 
energy of the solution, which will be interpreted as the classical mass, can be 
obtained from eqn (15.68) 


Mass = TA fale) (15.71) 


where f(A/e’) is the value of the integral in (15.68) and has been calculated 
numerically to be of order unity for a wide range of values of A/e*. Thus the 
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scale for the mass of this classical solution is set by the parameter a, which is 
the vacuum expectation value of the scalar field and is also the scale for the 
spontaneous symmetry breaking SO(3) > SO(2). 


The ’t Hooft—Polyakov soliton as a magnetic monopole 


From the asymptotic condition (15.70b), we see that at large distances 


s lo., W 
FÌ ~ — rr, ~ clerk 
k 

oer aer? 4 


which implies that the magnetic field in eqn (15.65) at large distance is 
B ~ — -—: (15.72) 


Comparing this with eqn (15.16), we see that this field is due to a monopole 
which has a magnetic charge 


g = —4n/e. (15.73) 


The constant e in (15.73) is the electromagnetic coupling constant which in 
this simple model is related to the electric charge operator by 


Q = eT, (15.74) 


where 7; is the third component of the weak isospin operators, which are the 
generators of SO(3) gauge symmetry. Since the smallest possible non-zero 
electric charge which might enter the theory is qọ = e/2 corresponding to 
T, = 4, we see that eqn (15.73) gives 


Tg =, (15.75) 


Thus the magnetic charge g of the monopole takes its lowest value when 
compared to the Dirac condition (15.11). This classical finite energy solution 
which is topologically non-trivial is called the °t Hooft-Polyakov monopole. 
Note that the quantization of the electric charge given in (15.74) is a 
consequence of the fact that the electromagnetic U(1) symmetry is embedded 
in the simple non-Abelian gauge group, in this case SO(3), and is independent 
of the existence of the monopole solution. Rather, here one discovers that 
both magnetic monopole and charge quantization are consequences of 
spontaneous symmetry breaking of a non-Abelian simple group down to the 
electromagnetic U(1). 

This ’t Hooft—Polyakov monopole differs from the Dirac monopole in two 
important aspects; the °t Hooft-Polyakov monopole has a finite core while 
the Dirac monopole is a point and there is no need for the Dirac string in the 
t Hooft-Polyakov monopole. The finite size of the °t Hooft—Polyakov 
monopole core comes from the fact that for large € eqn (15.69a,b) becomes 

d*K d*h 2A 


de de? 
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where H = h + é. Thus for large č, we have 


K~ec2ze™ 


H — ë ~ eTM we 


where u = (2A)?a and M ~ ea are the masses of the scalar and gauge bosons, 
respectively. This implies that the approach to the asymptotic form of each 
field is controlled by the masses of the corresponding particle. Hence we can 
think of the °t Hooft-Polyakov monopole as having a definite size 
determined by these masses. For distances smaller than this size, the massive 
fields play a role in providing a smooth structure and, for distances larger 
than this size, they vanish rapidly to give a field configuration indis- 
tinguishable from that of a Dirac monopole. From this one can also have a 
simple understanding of its mass value as given in (15.71). We divide the 
contribution to energy in (15.68) into two parts, coming from fields inside 
and outside the core respectively. Outside the core, D,@ = 0 and the electric 
field E = 0, only the magnetic field survives: 


l z l 14 
[ene -3(2) | dn? dr = 5m. 


For the stationary solution the core contribution should be comparable, 
yielding E = O(M/a). Thus the monopole is heavy because it has a small 
core and the Coulombic magnetic energy diverges as r > 0. 

As for the Dirac string, it is replaced in the °t Hooft-Polyakov monopole 
by the scalar field. To see this, write the asymptotic solution as 


i aij "j 
At =e F Pi = gT (15.76) 


We can write A! in eqn (15.76) as 
; l ; 
Ai = abc 4b ĝi c ; 
a= a; i? ab (15.77) 


and the magnetic field tensor at large distance is given by 


Fi = Ai — ðA, — e( Ai AG — ALAL) 
re a | | | 
= GAL — dA; + — $: (p x d). (15.78) 


Thus the magnetic field tensor has the form 
Fi = ðA — ðA’ + (extra term). 


In the Dirac monopole the extra term is singular and involves the Dirac 
string, while in the °t Hooft—-Polyakov monopole the extra term is smooth 
and involves the scalar fields. Thus, in some sense, in the °t Hooft—Polyakov 
monopole the singular Dirac string has been ‘smoothed’ into a scalar field. 

To summarize, in the SO(3) model where the non-Abelian symmetry is 
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spontaneously broken down to the electromagnetic U(1) symmetry, there 
exists a topologically non-trivial finite-energy solution, the °t Hooft- 
Polyakov monopole, with the following features. 


(1) It has the same behaviour as the Dirac monopole at large distance; 

(2) It has a finite core size determined by the masses of the gauge boson or 
scalar particle; 

(3) The classical mass of the monopole is of order of the spontaneous 
symmetry-breaking scale-the vacuum expectation value of the scalar field; 

(4) There is no need for the Dirac string. 


Coupling of the spatial and internal symmetries 


Finally let us take note of an important feature of the monopole in the 
non-Abelian gauge theory with spontaneous symmetry breaking: it mixes the 
spatial and the internal symmetries. In fact the °t Hooft—Polyakov ansatz 
(15.67) has the feature that it is symmetric with respect to rotations 
generated by 


x as es | (15.79) 


where L is the spatial angular momentum (including the ‘ordinary’ spin) and 
T is the internal symmetry ‘isospin’ generator. Thus for example the form of 
@:Tocr-T in (15.67) shows clearly that it is invariant under J. As a 
consistency check, we shall write out (15.79) more explicitly. The canonical 
momentum being p; = mv; + e(A;: T) with the gauge field outside the core 
being given by (15.76) we have 


p=mi+8xT (15.80) 
and 
J=rxp+T 
=rxmr+rxrx T+T 
=r x mi + (f TY. (15.81) 
Comparing to (15.18): 
Jar x mi -i Ot (15.82) 


and noting that the magnetic charge of the °t Hooft-Polyakov monopole is 
given by (15.73), we have the consistency condition 


Q = ef-T. (15.83) 


This is indeed the correct identification because outside the core we have 
g =f and the unbroken electromagnetic U(1) symmetry corresponds to 
those SU(2) rotations that leave @ invariant. For example, if f is in the z- 
direction, @ will be in the 3-direction in the SO(3) space and f: T = T; is the 
generator for rotation around the 3-axis which will leave invariant. 
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This ‘coupling’ of the spatial and internal symmetries has some surprising 
consequences. For example, consider scalar particles (@*, @~ ) transforming 
as a doublet under T: with Q = eT; we have 


Op* = +3e¢*. 


Equation (15.79) informs us that the combined system of a monopole and 
this isodoublet can have half-interger angular momentum even though no 
fundamental fermion field has been introduced. This phenomenon of ‘spin 
from isospin’ has been discussed by Jackiw and Rebbi (1976), and by 
Hasenfratz and ’t Hooft (1976). That this does not violate the usual 
connection between spin and statistics has been shown by Goldhaber (1976). 

By the same reasoning we can see that the Dirac equation in the back- 
ground monopole field A,(x) can have integral J partial wave solutions. 
In this connection there is another surprising result: fermions are found to 
change their nature when scattering off a monopole in the S-wave. As the 
angular momentum in (15.79) and (15.82) acquires a sign change when the 
fermion passes through the monopole core: f > —f, J can be conserved only 
if (i) the charge, or (ii) the helicity of the fermion, make the corresponding 
change. This ‘paradox’ can be resolved only when one invokes results of the 
quantum monopole theory. (1) For the charge change, one discovers that a 
quantized monopole possesses a tower of excited states which have the 
charges of the gauge bosons coupling to the upper and lower members of an 
isodoublet. Such electrically and magnetically charged states are termed 
dyons. Thus overall charge conservation can be maintained in the fermion— 
monopole scattering as monopole turns into a dyon, 


M+ > D+H. (15.84) 


(2) For the helicity change, this is possible in the presence of monopole fields 
because we have a nonvanishing axial anomaly: F-F = E -B #0 and the 
possibility of chirality nonconservation (see §6.2), and for the massless 
fermions chirality and helicity changes are correlated. 


Grand unified monopoles 


The SU(2) > U(1) °t Hooft-Polyakov monopole solution may be re- 
garded as the fundamental pattern when we consider the monopole solutions 
to larger gauge groups. Topological considerations lead to the general result 
that stable monopole solutions occur for any gauge theories in which a simple 
gauge group G is broken down to a smaller group H = h x U(1) containing 
an explicit U(1) factor. For a review of the topological arguments see 
(Coleman 1975, 1981). Clearly this is compatible with our expectation that 
charge quantization and existence of monopole are related and that charge 
quantization follows from the spontaneous symmetry breaking of a simple 
gauge group. Thus in the grand unified theories where the symmetry is 
broken from some large simple group, e.g. SU(5), to SU(3), x SU(1)em, there 
are also monopole solutions of the °t Hooft—Polyakov type. The monopole 
mass is determined by the mass scale for the symmetry breaking, Mx, and is 
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of order M,/e?. In the SU(5) model, we have Myx z 1014 GeV which implies a 
very heavy monopole mass ~10'® GeV. This means that this type of 
monopole is out of reach for its production by accelerators. But it could be 
relevant for physics in the extreme early universe (see Preskill 1979; Guth and 
Tye 1980). In fact the attempts to suppress the monopole abundance in the 
conventional cosmology led originally to the ‘inflationary universe scenario’ 
(Guth 1981) and its subsequent refinement (Linde 1982; Albrecht and 
Steinhardt 1982; Hawking and Moss 1982), which has the promise of 
solving several fundamental problems in cosmology. 

It should be noted that the stable grand unified monopole having the 
smallest magnetic charge (hence the smallest mass) is expected to have both 
the ‘ordinary’ and the colour magnetic charges. (The colour magnetic fields 
are then supposed to be screened by the gluons.) Namely the SU(2) > U(1) 
embedding in G > SU(3), x U(1),, is such that the final ‘magnetic U(1)’ 
factor sits in both U(1),,, and SU(3),. To see this more explicitly consider the 
following SU(2) embedding in the SU(5) group of Chapter 14: 


T= ; (15.85) 
0 


where t are the Pauli matrices. Thus they act both in the colour SU(3) (the 
first three components) and the electroweak SU(2) sectors. (The alternatives 
with the third colour replaced by either the first or the second colour are 
equally possible.) It has been shown that this embedding (15.85) leads to the 
smallest magnetic charges (Dokos and Tomaras 1980). Given the electro- 
magnetic charge matrix of (14.12) in SU(5) model, the form of the 
monopole Higgs field @ ~ f suggests that the charge matrix can be written in 
the following spherically symmetric form 


(15.86) 


which agrees with (14.12) when f = z. The monopole vector potential is then 
given as in (15.76): 


l 


Af = — € 
gsr 


l 


î (15.87) 


aij j 


where g, is the SU(5) gauge coupling constant and is related to the 
electromagnetic coupling e by (14.50) and the QCD SU(3), coupling g, by 
(14.47): 


e = VIs, Is = 93. (15.88) 
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Equation (15.87) yields a matrix-valued magnetic field as 


A -T 
B; = BiT? = 46,,F%,T" = oe fy. (15.89) 
5 


One can project out the electromagnetic and colour components by taking 
the trace of the product of this magnetic field matrix B with the generator 
matrices of the U(1).,, and SU(3), gauge groups. For the normalization 
factors one should keep in mind the relations in (15.88) and the fact that for 
the fundamental representation tr(T?7°) = 46". Thus (15.89) has the 
electromagnetic component 


3 i, 


corresponding to a magnetic charge of one Dirac unit g = 27/e. Similarly the 
colour magnetic fields are calculated to be 


1 f; 
= A) = — 6% — = O 15.91 
Bi = 2 tr(B;A*) VE ô m a ; ( ) 
Clearly (15.90) and (15.91) show that the ‘magnetic charge matrix’ has two 
components, represented in the unitary gauge as 


T; = —3(Q + Q.) (15.92) 


Q, being the colour hypercharge /,/./3. 

One can see this need for colour magnetic field more ‘physically’ by 
considering an Aharonov-Bohm experiment for a fractionally charged 
quark, say d, (i.e. a down-quark with the first colour). The null-effect 
condition is satisfied because of a cancellation due to the presence of the extra 
phase factor coming from the quark colour charge q, multiplying the colour 
magnetic flux of the monopole 


exp} is $ dl - Aom + ipa AS =e '2n/3 ei2n/3 _ | 


where we have used e = 27/g, etc. 


Monopole catalysed proton decay. With respect to the SU(2) embedding 
(15.86) the fifteen left-handed members of the first SU(5) fermion family 
(14.7) and (14.9) have the following transformation properties: 


+ c c c 
doublets: (' ($) (a) = 
dg/i \e7/i W/L W/L 


singlets: vi, diL, dzL, Uzt, diL, dai, U3L. (15.93) 


With these doublets it is possible to have baryon-number-changing reactions 
such as 


M +u +u, +d; > M +e. (15.94) 
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This may be regarded as the inverse process of (15.84) with subsequent ‘dyon 
decay’. Normally one would expect that such transitions be strongly 
suppressed by the super-heavy mass factor My. However studies of the 
fermion—monopole dynamics by Rubakov (1982), Callan (1982), and 
Wilczek (1982a,b) suggest that the fermion—monopole vacuum may be highly 
degenerate and condensates such as (u,u,d3e Jo exist in regions of space 
where the magnetic and chromomagnetic fields of the grand unified 
monopole coexist. We expect the colour magnetic field to extend as far as the 
confinement radius of order 1 fermi. Thus it is suggested that the baryon- 
number-nonconserving interaction (15.94) may have a typical strong interac- 
tion cross-section. This will manifest as proton decays strongly catalysed by 
monopoles. Although the theoretical understanding of this mechanism is still 
under investigation, it is clear that quantum theory of monopole—particle 
interactions will be highly nontrivial and should contain much interesting 
physics. 


16 Instantons 


In the previous chapter we encountered particle-like solutions of Yang—Mills 
field theory, the monopoles; they correspond to fields with nontrivial 
topological properties in ordinary three-dimensional space. We now study a 
class of solutions with topological structure in the Euclidean four- 
dimensional space-time, the instanton solutions. Like the soliton solutions of 
Chapter 15, they have finite spatial extension—thus the ‘-on’ in its name— 
and unlike solitons, they are also structures in time (albeit imaginary time)— 
thus the ‘instant-’. For the same reason, they are also called ‘pseudo particles’ 
in the literature. 

Again, as in Chapter 15, the presentation will be given at an elementary 
level. The organization of this chapter is as follows. In §16.1, after a brief 
introduction to the topological notion of homotopy we show how the 
instanton solution can be obtained in the Euclidean Yang—Mills theory. The 
interpretation of instantons as tunnelling events between vacuum states with 
different topological quantum numbers is presented in §16.2. It shows that 
the vacuum of non-Abelian gauge theory in general, and QCD in particular, 
is not unique. A Yang-Mills Lagrangian actually represents a continuum of 
theories labelled by a parameter 6—just as each value of coupling constant 
describes a different theory. In §16.3 massless fermions are incorporated into 
the theory; their presence suppresses the vacuum tunnelling. One then sees 
how the nontrivial structure of QCD vacuum as revealed by the instanton 
solutions can help us solve the famous axial U(1) problem present in any 
quark theory with chiral symmetry. On the other hand the instantons 
themselves bring about strong P and CP violations. Possible ways out of this 
difficulty are briefly mentioned. 

Throughout this chapter, especially in the first two sections, we work with 
the SU(2) Yang—Mills theory. For applications we consider mostly those for 
QCD which is SU(3). However all results are understood to be valid for the 
SU(2) subgroup of SU(3), or any other non-Abelian groups of higher rank. 


16.1 The topology of gauge transformations 


In Chapter 15 we encountered fields with nontrivial topology. They 
correspond to mappings from S° to S?, where S° is a sphere in three- 
dimensional space and is a two-dimensional manifold, i.e. a two-sphere. In 
this chapter we need to consider mappings S? — S°, where S° is a sphere in 
four-dimensional Euclidean space, i.e. a three-sphere. 
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Homotopic classes 


To study the topological properties of continuous functions, one can divide 
them into homotopic classes; each class is made up of functions that can be 
deformed continuously into each other. More precisely, let X and Y be two 
topological spaces and p(x), f(x) two continuous functions from X to Y. Let 
I denote the unit interval on the real line 0 < t < 1; fo and f, are said to be 
homotopic if and only if there is a continuous function F(x, t) which maps the 
direct product of X and J to Y such that F(x, 0) = p(x) and F(x, 1) = f(x). 
The continuous function F(x,t) which deforms the function f(x) con- 
tinuously into f(x) is called the homotopy. We can then divide all functions 
from X to Y into homotopic classes such that two functions are in the same 
class if they are homotopic. 

To illustrate the notion of homotopic classes, we consider the following 
examples. 


St + S’. Let X be the points on a unit circle labelled by the angle {6}, with 
6 and 0 + 2z identified, and let Y be a set of unimodular complex numbers 
u, = {e"}, which is topologically equivalent to a unit circle, a ‘one- 
dimensional sphere’. We consider the mapping {6} > {e}. The continuous 
functions given by 


S(O) = expfi(n8 + a)] (16.1) 


form a homotopic class for different values of a and a fixed integer n. This is 
because we can construct a homotopy 


F(0, t) = exp{i[n0 + (1 — 165 + 16,]} (16.2) 
such that 
Jo(®) = expli(n8 + 8o)] 


fF, (0) = exp[i(né + 0,)] (16.3) 


are homotopic. One can visualize f(@) of (16.1) as a mapping of a circle on to 
another circle. In this mapping, n points of the first circle are mapped into 
one point of the second circle and we can think of this as ‘winding around it n 
times’. Thus, each homotopic class is characterized by the winding number n, 
also called the Pontryargin index. From (16.1), the winding number n for a 
given mapping f(0) can be written 


and 


dô —i df (0) 
= | Sela T | oy 
0 


Of particular interest is the mapping with the lowest nontrivial winding 
number, n = 1, 


SO Se". (16.5) 


By taking powers of this mapping, we can get mappings of higher winding 
numbers. For instance, the mapping [ f"(@)]” will have winding number m. 
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We can also write the mapping in (16.5) in the Cartesian coordinate system as 
f(x y)=xtiy with x+y =1. (16.6) 


We can generalize the domain_X of this mapping from the unit circle to the 
whole real line — œ < x < œ, by identifying the end-points x = œ and 
x = — œ to be the same point, i.e. the mappings are required to satisfy the 
property f(x = œ) = f(x = — oo). Clearly this has the same topology as the 
unit circle. Examples of this type of mapping with winding number n = 1 are 

f(x) = exp{inx/(x? + *)*} (16.7) 
(A + ix)? 
A? + x? 
where å is an arbitrary number. In this case the topological winding number 
for a general mapping can be expressed as 


1 T —i df (x) 


which yields n = 1 for the functions f(x) and f(x) given in (16.7) and (16.8). 


f(x) = exp{i2 sin™ *[x/(x? + A*)#]} = (16.8) 


S? — S°. We now consider mappings from a three-sphere to SU(2) space, 
i.e. mappings from the points on S°, the sphere in four-dimensional 
Euclidean space labelled by three angles, to the elements of the SU(2) group, 
which are also characterized by three parameters. More explicitly, the 
manifold of the SU(2) group elements is topologically equivalent to a three- 
sphere S°. This is because, any element in the SU(2) group can be written in 
terms of the Pauli matrices as U = exp{ie-t}. From the identities of the Pauli 
matrices, we can write U = uy + iu:t with real uy and u satisfying 


ug + u? = 1 


which follows from UUt = U'U = 1. This is clearly the equation for the 
sphere in four-dimensional Euclidean space, S°. Mappings in this case are 
also characterized by the topological winding number n and the generaliza- 
tion of (16.6) with n = 1 is 


f(Xo.X) = Xo + ix't with x3+x?=1. (16.12) 


It can be shown that the topological winding number n can be expressed as 
(see, for example, Coleman 1977) 


—| 


Yan? 


where 
A; =f ~*(xo, x) Of (Xo, X) (16.13) 


and 6,, 02, and 0, are the three angles that parametrize S°. 
One can also generalize to the case where the domain X is the whole three- 
dimensional space with all points at infinity identified. Examples of the 
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mappings with n = | are 
f(x) = expfinx -t/(x? + 17)*} (16.14) 
f(x) = (At + ix)?/(x? + 17) (16.15) 


which are generalizations of the mappings given in (16.7) and (16.8). As for 
the winding number in (16.9), we now have the volume integral 


— | 
n= 74r? [ers tr(é;;,A;A ; Ay) 
y 
Ae O: iS) 


As we shall see, SU(2) transformations of the form (16.12), (16.14), or (16.15) 
are very much of physical interest. 


The instanton solution to Euclidean gauge theory 


Here one seeks the (finite-action) solution to classical Yang-Mills theory in 
Euclidean space (x? = x2 + x”). The SU(2) gauge fields 


A, = At Fay = 5 Fa, j=l 23 
have the Lagrangian 
l 
et a (16.17) 


2g? 


where for notational convenience we have scaled the gauge fields as 


with 
Fiv = 0,A, — 0,A, + [4a Ay]. (16.18) 
Under a gauge transformation U, we now have 
A, = U-'A,U+U~* 0,U. (16.19) 
We require the solution to satisfy the boundary condition that the Lagrange 


density vanishes, i.e. F,,F,, = 0, at infinity so that the Euclidean action, 


l 
Se = [arz tr Fy Fay), 


is finite. This means that in Euclidean space 


F(x) — 0. (16.20) 


Normally we take this to mean 


A,(x) —> 0. (16.21) 
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From the viewpoint of gauge transformations (16.19), this is much too 
restrictive. Condition (16.20) only requires 4, to approach the configuration 


A,(x) — U~*4,U (16.22) 


which is obtained from A,(x) = 0 by a gauge transformation (16.19) and thus 
also yields (16.20). We say such a A,(x) field is a ‘pure gauge’. One notes that 
points at infinity (|x| > œo) in four-dimensional Euclidean space are three- 
spheres and the gauge transformation U in (16.22) represents mappings from 
S? to SU(2) space. Thus the Us are just the S? > S? type of functions 
discussed in connection with homotopic classes, with some topological 
winding number. The instanton solution discovered by Belavin, Polyakov, 
Schwartz, and Tyupkin (1975) corresponds to Us with a nontrivial winding 
number, i.e. n = 1 (with n > 1 solutions for instantons far apart in space- 
time obtainable by multiplication). 

Let us first express the winding number in terms of the gauge fields. For 
this purpose we introduce an (unobservable) gauge-dependent current 


K, = 4€,y,, tt[A, 0,4, + $A,A,A,]. (16.23) 
It is straightforward to check that 
6,K, = 2tr(F,,F,,y) (16.24) 
where 
en oe (16.25) 


is the dual of F,,. Consider the volume integral 
[ats tr(F,,F,,) = 4 [ats K =% [aok (16.26) 
S 


where the surface integral is over the S° at infinity. In this region A, is given 
by (16.22) and using the antisymmetry properties of the indices and U'U = 1, 
we have 


K, = 4 ya) tt[(Ut 8,U)(Ut 8,U)(Ut 8,U)]. (16.27) 


u 


Substituting (16.27) into (16.26) and comparing it to the expression for the 
S? — S? winding number in (16.16) we obtain 


l $ 
aes [ats tr(F Fav). (16.28) 


To find A,(x) satisfying the boundary condition (16.22) on the Euclidean 
three-sphere at infinity, we make use of the important positivity condition in 
Euclidean space 


tr | (Fe + F,,)? d*x > 0. (16.29) 
Since 
alia): S20 ie aks): (16.30) 
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we have the inequality 


= 16n?n (16.31) 


uv* uv 


tr | Bek d*x > |tr |z F, d*x 
where we have used (16.28). Thus, the Euclidean action satisfies the 
inequality 


g 2 
S:(4) > oe (16.32) 


This implies from (16.29) that the action is minimized (i.e. equality achieved) 
when 


(16.33) 


i.e. the self-dual or antiself-dual fields are the (finite-action) solutions to the 
classical Euclidean Yang—Mills theory. We remark that the usual solution 
A, = 0 which has trivial topological quantum number (i.e. n = 0) clearly 
satisfies condition (16.33). 

To find nontrivial self-dual gauge-field solutions, Belavin et al. (1975) 
employed the strategy of first considering F,, of O(4) gauge theory, which is 
isomorphic to SU(2) x SU(2): one can identify one SU(2) with the SU(2) of 
the internal symmetry and the other SU(2) with the three-sphere at space- 
time infinity. Also in O(4), since F,„ is really a matrix with four indices 
F%s, where («, 8) are the internal O(4) space indices and (u,v) are the 
Euclidean O(4) indices, the self-dual condition of (16.33) can be translated 
into simple symmetry conditions on these charge and space-time indices. 
From this it is possible to construct the solution explicitly. We shall not 
provide the details here as the SU(2) gauge transformation has just the form 
discussed in eqn (16.12) 


U(x) = ra (16.34) 


where p° = xo + x°. This gives rise to a gauge field 
2 


A, (x) = (o ôU (16.35) 


where å is some arbitrary scale parameter, often referred to as instanton size. 
For p > A, we have 


A(x) + U~* 6,U (16.36) 
as required by the boundary condition. More explicitly we can write 


—1T°x —i(TXy + T X x) 
A De ee Att) 4-2 16.37 
o(X) p? 4 2 (x) p? re 2 ( ) 
One can further check that the corresponding action integral indeed has the 
value 827/g?. 
In the next section we shall discuss the physical interpretation of these 
nontrivial minima of the Euclidean action. 
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16.2 The instanton and vacuum tunnelling 


The physical interpretation of the instanton solutions as quantum- 
mechanical events corresponding to tunnelling between vacuum states of 
different topological numbers was first advanced by ’t Hooft (1976), and later 
substantiated in the work of Callan, Dashen, and Gross (1976) and of Jackiw 
and Rebbi (1976). 

In Feynman path-integral formalism the basic vacuum-to-vacuum transi- 
tion amplitude is expressed as a sum (a functional integral) over all possible 
paths between the initial and final states, weighted by the exponential of 1 
times the action for the particular path. In previous discussions we have 
included in our sum of path history only the condition A,(x) > 0 on the 
boundary. We shall see that non-Abelian gauge-field theory has a vacuum 
with an unexpectedly rich structure. It corresponds to a superposition of 
vacuum states with different topological winding numbers. The instanton 
field configurations correspond to paths that connect initial and final vacuum 
states with different winding numbers. We shall examine the effect of 
including all such field configurations in our path-integral formalism. 


Multiple vacuum states 


To be more specific let us place our system inside a box. The vacuum 
condition 


f= (16.38) 
is obtained for the region 
t< —T/2, t>T7/2, and |x|>R (16.39) 


with T and R both very large. The 2 + 1 space-time version is depicted in 
Fig. 16.1. One then sums over all paths A,(x) that are consistent with 
the boundary condition that the vacuum state (16.38) be maintained outside 
the cylinder. 


Fic. 16.1. 


Throughout this section we shall be working with the gauge-fixing 
condition 


Aj(x)=0 forall x. (16.40) 


But we are still left with the freedom of time-independent gauge 
transformations 


dyU(x) = 0 (16.41) 
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because 
A(x) > Ao(x) = U~*(x)Ao(x) U(x) + U(x) ð U(x) = 0 


where we have used both (16.40) and (16.41). Thus the vacuum will be 
described by a time-independent potential A,(x) which is a pure gauge 
potential 


Ax) = A(x) = U~ +(x) 6,U(x). (16.42) 
At initial time t = — 7/2, we can use this remaining gauge freedom to pick, 
for example, 
A(x) = 0 (16.43) 
by choosing 
U(x) = 1. (16.44) 
Then the vacuum state condition (16.38) implies that 
Fo; = 004; = Q. (16.45) 


Thus (16.43) is maintained throughout the vacuum, 1.e. throughout the space 
outside the cylinder in Fig. 16.1. In particular on the top-surface of the box 
(t = 7/2) we have the situation of a disc with its edge identified, i.e. all points 
on the edge have the mapping (16.44). In realistic 3+ 1 space-time 
dimensions, this corresponds to the t = 7/2 vacuum being a pure gauge of 
mappings to the SU(2) gauge group manifold from three-dimensional space 
with infinities identified. As we have discussed in the previous section such 
gauge transformations, and hence the corresponding vacuum states (16.42), 
can be divided into inequivalent homotopic classes. An example of (16.42) 
with 


U(x) = expfint:x/(x? + A7)?} (16.46) 


as given by eqn (16.14) is ann = 1 vacuum state. Thus we conclude that there 
is a multiplicity of vacuum states |”), each characterized by its topological 
winding number. 


The formal relation between multiple vacua and the instanton solution 


The instanton solution, as we have shown in §16.1, is itself characterized by a 
winding number v. We shall show that such a path in Euclidean space 
connects vacuum states that differ by that winding number 


A(x, Xo = — 0) = A,(x) of vacuum state |n> 
A(x, Xo = 00) = A,(x) of vacuum state |n + vò. (16.47) 


To see this formally let us recall the basic v = | instanton solution as given 
by eqns (16.34), (16.35), and (16.37). To cast it in the form of the Ay = 0 
gauge of (16.40) we make a gauge transformation on A,(x) of (16.37) 


A (x) > AX) = VAOVA + V(x) AV). (16.48) 
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The condition A¢(x) = 0 implies that 


L V(x) = —Ao(x)V(x) 
OXo 
ix ‘°T 
zae a (16.49) 


where we have written out the instanton solution (16.37). Eqn (16.49) can be 
integrated to yield 


ix'T 


V(x) = OP ore (tan a ae + bo) (16.50) 


We can set the integration constant to be 
Oo =(n+4)n. (16.51) 


The result in (16.49) is obtained by requiring the boundary condition on the 
time-component A,(x) > Ao(x) = 0. If we take the spatial component A,(x) 
to be zero at x» = +o, then 


A; = V Hx) ô V(x) (16.52) 
with 


V(Xo = — oo) = exp in a ra | 
and 


V(xo = oo) = epia oe (n af Df (16.53) 


Thus the instanton solution of (16.37) indeed connects two vacuum states [cf. 
eqn (16.46)] that differ by one unit of winding number. This suggests that for 
a path-integral representation of the vacuum-to-vacuum transition ampli- 
tude, eqn (1.67) should be generalized to 


<nle~'""|m>, = [84 cam exp = ie + JA) atx (16.54) 


where | and |m> denote vacua with winding numbers n and m, respectively. 
Thus, we should sum over all gauge fields belonging to the same homotopic 
class with winding number v = n — m. 


The instanton as a semi-classical tunnelling amplitude 


The fact that the instanton is the minimum of the Euclidean action, i.e. it is 
the classical path for imaginary time, reminds one of the semi-classical 
(WKB) barrier-penetration amplitudes in non-relativistic quantum 
mechanics. This indeed turns out to be a fruitful analogy and we shall 
present a brief review of the elementary quantum-mechanical example of 
tunnelling between the two ground states of the double-well potential in 
Fig. 16.2(a) 


V(q) = (P — 40)" (16.55) 
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V(q) 


Fic. 16.2. 


where q(t) is some generalized coordinate. The energy of the system is then 


_ 1 fdq\ 
E=- (4) + VQ). (16.56) 


The classical ground state for this system is at q = qọ Or q = — qo with E = 0. 
In classical mechanics clearly there is no E = 0 path leading from gy to — qo. 
However there can be quantum-mechanical tunnelling, so that the true 
ground state is neither |qọ> nor |— qo) but their superposition 


2 


The quantum-mechanical tunnelling amplitude can be calculated using the 
classical particle trajectory in the imaginary time system. This is because in 
the imaginary time system where t = —it and (dg/dt)* = —(dq/dt)”, the 
energy of the system is given by 


1 (dq V 
ape $ ($) 2 Ma) (16.56a) 


which is equivalent to a particle moving in the potential — V(qg) shown in Fig. 
16.2(b). Thus in imaginary time there will be a path going between — qo and 
qo With E = 0. Setting E = 0 in eqn (16.56a) we can solve for this trajectory, 


g(t) = qo tanh (2767). (16.58) 


iground > = —; (lqo> + |—4o>). (16.57) 


The action for this trajectory in the imaginary-time system is finite and can be 
calculated 


+ 0 5 + 00 
S, = | ec} ($4) = -vo = 2 | dtV(q) 
2\dt 
aa | (q(t) — qF dt = $ 2443. (16.59) 


To get the tunnelling amplitude, we will use the path-integral formalism, in 
which the transition amplitude is given by 


<q lgi> = [tan ae (16.60) 
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In imaginary time (or Euclidean space), this becomes 


Cail eta,» = [tan gene (16.61) 


where Sẹ = —iS is the Euclidean action. The right-hand side of eqn (16.61) 
can be visualized as the summation over all possible paths going from q; to q;. 
In the semi-classical approximation (expansion in powers of h), the integral 
in (16.61) will be dominated by those paths for which Sg is stationary. It is 
not hard to see that the tunnelling amplitude for the case we are considering 
here will be of the form 


T ~ eS” + O(h)] = exp(—4 V2 g3/3h). (16.62) 


This can be verified in this simple case by explicit calculations (for detail see 
Coleman 1977). 

This example and eqn (16.53) suggest the interpretation that the instanton 
configuration corresponds to tunnelling between different vacuum states: 
Iny — |n + 1>. When we generalize the transition amplitude in (16.61) to field 
theory, we will find that not all field configuration (paths) give finite action 
because there are infinite degrees of freedom in field theory. But in the semi- 
classical approximation a configuration of infinite action will give zero to the 
path integration weighted by e~ **. Thus the path integral will be dominated 
by configurations with finite action. Hence the tunnelling amplitude in the 
semi-classical approximation can be calculated in terms of the instanton 
configuration and has the form 


T ~ e™ãe œ~ exp(— 827/97) (16.63) 


where we have used eqn (16.32). The form of eqn (16.63) also shows clearly 
that it is an effect that cannot be seen in ordinary perturbation theory. 


-vacuum 


As vacuum states |n corresponding to different topological winding 
numbers are separated by finite-energy barriers and there are tunnellings 
between these states, we expect the true vacuum state to be a suitable 
superposition of these |n> states. We note that under a gauge transformation 
T, having a winding number 1 itself, we have 


T,|n> = |n+ 1> (16.64) 
and gauge invariance means that it commutes with the Hamiltonian 
[7,, H] = 0. (16.65) 


Thus we have a situation very similar to the familiar quantum-mechanical 
problem of periodic potential: with T being the translation operator and with 
the true ground state being the Bloch wave and there exists a conserved 
pseudomomentum. We can construct the true vacuum, the ‘0-vacuum’ as 


0> = ¥ ein) (16.66) 
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which is an eigenstate of the gauge transformation 
T,|05 = e|0). (16.67) 


Just like the pseudomomentum in the case of a periodic potential, 0 labels the 
physically inequivalent sectors of the theory and within each sector we may 
study the propagation of gauge-invariant disturbances. Since different 0- 
worlds do not communicate with each other, there is no a priori method of 
determining the value of 0. 

In terms of the 6-vacuum, the vacuum-to-vacuum transition amplitude in 
path-integral formalism should be 


6'le #19), = 6(0 — ODIO). (16.68) 


Writing the left-hand side in terms of vacuum states with definite winding 
numbers we have 


CO'e FOS , = > eid e7" m] en, 
m,n 


= pA e7 i0- m)® e" — 8) [ta41,-, ell (P+ JA)d4x (16.69) 


myn 


where we have substituted in the expression (16.54). We can cast (16.69) in 
the standard form by relabelling n — m —> v; after doing the m-summation 


1,(6)=ye™ fraa, exp| =i je + JA) ax | 


=) (raan, exp] =i [ea + JA) atx (16.70) 
and using eqn (16.29), we obtain 
0 Š 
La = £+ Te tr(F, FY”). (16.71) 


Thus the rich structure of gauge theory vacuum corresponding to tunnelling 
between states with different topological winding numbers gives rise to an 
effective Lagrangian term which violates P and CP conservation. As we have 
already mentioned in §10.2, such a term is normally discarded because the FF 
term can be expressed as the divergence of current (eqn (16.24)) and hence as 
a surface term in the action (eqn (16.26)). However because there is a 
nontrivial instanton gauge field configuration which does not vanish at 
infinity (eqn (16.22)), such an ‘abnormal’ term actually survives in non- 
Abelian gauge field theories. 


16.3 Instantons and the U(1) problem 


The U(1) problem 


The ideas of chiral symmetry and quark-gluon interaction as described by 
QCD seem to lead to a contradiction. For simplicity we will illustrate this 
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problem in the case where there are only two quark flavours in the theory, u 
and d quarks. In the limit m,a > 0, the QCD Lagrangian has the symmetry 
SU(2), x SU(2)p x U(1)y x U(1), which is larger than the chiral symmetry 
SU(2), x SU(2)p discussed in Chapter 5. The U(1)y symmetry generated by 
the transformation q; > eq, gives rise to the current 


J = ty,u + dy,d. 


This is just the baryon-number current for this two-flavour case and the 
U(1)y symmetry manifests itself in the baryon-number conservation. But the 
U(1), symmetry generated by the transformation q; — esq; with the current 
given by 


J? = Uy,ysu + dy,ysd (16.79) 


does not seem to correspond to any observed symmetry in the hadron 
spectra (e.g. we do not observe a parity doubling of the baryon states). 
Thus we expect this U(1), to be realized in the Goldstone mode and to give 
rise to an additional massless pseudoscalar besides the pion isotriplet which 
are the Goldstone bosons resulting from the spontaneous breakdown of the 
chiral SU(2), x SU(2)p symmetry. As we turn on the quark mass, this new 
U(1) Goldstone boson is expected to have a mass comparable to that of the 
pion because they all have the same quark composition. (This statement will 
be made more quantitative later on.) Experimentally no such isoscalar 
pseudoscalar meson has been seen. The n-meson has the right quantum 
number, but is simply too heavy. This is usually referred to as the ‘U(1) 
problem’ or the “‘n-mass problem’ (Glashow 1967). As we shall see, the 
existence of instantons will solve the problem (t Hooft 1976). There is 
another related U(1) problem that has to do with the decay n > 3m. We 
shall comment on that problem and its resolution later on. 


The n-mass problem. One may think that the presence of the ABJ anomaly 
in J? (which couples to gluons) will provide an escape from this paradox 
as the divergence of J? does not vanish in the m, a > 0 limit 


g’ 


oJ =4 162 


tr(G,,G"") + 2im,tysu + 2imadysd (16.80) 


where G,, is the gluon tensor matrix, G its dual, and we have undone the 
scaling of the field normalization of (16.17). (The factor 4 in front of the first 
term on the right-hand side comes from the fact that there are two flavours 
2N- = 4 in this theory.) However the matter is not so simple. We have 
already seen in eqn (16.24) that tr GG itself is the divergence of a current, 
g? 


K =4 
ži 16x 


tr GG” (16.81) 


with 
2 


K,=4 


772 Êuvå 
H 1672 HvAp 


tr G44? (16.82) 
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where A, = A{A*/2 is the gluon field. Thus we can define a new axial vector 
current 


Ji = J} — K, (16.83) 
which is conserved in the m,a > 0 limit, 
ô" J’ = 2im,dysu + 2imdysd. (16.84) 


Of course the current K,, and hence also J AE is unobservable, because it 


is not gauge-invariant. Nevertheless, because its charge 
O,= [Ji d°x (16.85) 


is conserved, this symmetry when realized in the Goldstone mode would 
demand the existence of an J=0 pseudoscalar meson with a mass mọ 
(Weinberg 1975) 


My < (/3)m, (16.86) 


To see this we can use the standard current-algebra technique to obtain a 
Ward identity entirely similar to eqn (5.228) 
me — k? 


mifi =i 4 — fie | d4x e* *<0T (0-75 (0) F5(x))|0) 


(0) 
+ | d+x e-**£0|6(x9)[6"F5(0), T310)! (16.87) 


where f is the isoscalar meson decay constant. After taking the k” — 0 limit, 
since there is no zero mass pole in the first term on the right-hand side, we 
relate m2f2 to a o-term which is of identical form to that for m2 f2 
encountered in §5.5. Thus, 

mofo = mfa (16.88) 


This reflects the fact that J a has the same commutator with quark mass terms 
as J}. To proceed further we can write (16.79) as a sum of an SU(3) octet and 
singlet 


1 2 
J} = wee + [ise (16.89) 
where 
l 


is RB (Uy,ysu + dy,ysd — 28y,758) 


AM £ : 
= | 3 Uyuysu + dyyysd + 57,755). 


To the extent that SU(3) is a good symmetry and all pseudoscalar octet decay 
constants are equal, we obtain immediately 


hon (16.90) 


1 
a 
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and hence the bound in eqn (16.86) especially if we allow for the possibility 
that there may be more than one isoscalar pseudoscalar meson coupled to 
J5. 

It was originally pointed out by Kogut and Susskind (1975b) that one way 
to avoid the disastrous conclusion of (16.86) is for J? to be coupled to a 
‘particle’ which is massless even if m,a 4 0. Then the first term on the right- 
hand side of (16.87) does not drop out in the k, > O limit and the simple 
relation between m2 f% and the o-term is spoiled and there is no restriction on 
Mo. Since J > is gauge-variant, it is conceivable that this gauge-dependent 
massless pole does not generate poles in physical (gauge-invariant) quan- 
tities. As we shall see, in °t Hooft’s resolution of the U(1) problem, the 
Kogut—Susskind mechanism is indeed realized in the new instanton 6- 
vacuum. The U(1), symmetry is spontaneously broken without generating 
the I = 0 Goldstone boson. 


The y — 3x problem. We should also remark that there is a second U(1) 
problem, related to n — 3n decay, which can also be solved by the Kogut- 
Susskind mechanism. We mentioned at the end of Chapter 5 that virtual 
photon exchange gives a vanishing amplitude for n — 3r in the chiral limit 
(Sutherland 1966). This decay must therefore be attributed to the isospin- 
violating quark-mass difference in the Lagrangian -density 


Lf = 4(m, — m,)(au — dd). (16.91) 
The leading term in chiral perturbation (Bell and Sutherland 1968) is then 


aL ——> (m -mAN YS (16.92) 


qa —>0 
with 
A = <nn|i(mūy;u + mıdy;d)n> 


= = <nm|d"J>(0)n> (16.93) 


where the two-pion system and n have equal momenta and energy. Thus 
even with m, 4 mg, we still have a vanishing n > 3m amplitude because the 
amplitude A in (16.93) is a total divergence between states with the same 
four-momenta. Again we can see a possible resolution of this second U(1) 
problem with the existence of a massless pole coupled in J > giving a nonzero 
A in (16.93). 


The -vacuum in the presence of massless fermions 


We now show how the structure of the 6-vacuum implies the spontaneous 
breakdown of the U(1), symmetry and how the associated massless particle 
decouples from physical quantities. Instead of following the original °t Hooft 
calculation in path-integral formalism, we shall only indicate the physical 
ideas with some heuristic arguments. 

Since Q, is not gauge-invariant, under a gauge transformation char- 


16.3 Instantons and the U(1) problem 491 


acterized by topological quantum number n (see eqn (16.64)) it changes (see, 
for example, Callan et al. 1976) as 


Os me 0; = T, OT, = Ô; + 2nNp (16.94) 


where Nç is the number of massless quark flavours. A simple check of (16.94) 
is to take the original ĝ, to be zero; then from 


ĝ; = [arse (16.95) 
with 
Neg? 1 
Ko = Sea Eoijk tt(A;A ;A;,) 
and 
A; = U`! 0,U 


we then have (16.94) when the above expression is compared with that for the 
winding number in eqn (16.16). Eqn (16.94) means that T, acts like a ‘raising 
operator’ of chirality—instantons ‘eat’ massless quark pairs 


[O;, T,] = 2AN; T,. (16.96) 
To be more explicit, 7, changes the winding number of the vacuum 
T,|0> = |n). (16.97) 
But 
O.|n> = 2nN-|n> (16.98) 


because of (16.96) and 0.|0) = 0. Thus the vacuum state with definite 
topological quantum number also has definite chirality (i.e. it is an eigenstate 
of O,). Since QO, is conserved 


[O;, H] = 0, (16.99) 


the vacuum-to-vacuum transition amplitude vanishes unless the initial and 
final states have the same winding numbers, 


<nlew my ~ Sim (16.100) 
or, more generally for an operator P,(x) with chirality 2Npv, 
Cne T P OAIM) ~ n-m, v- (16.101) 


Therefore, in the presence of massless fermions (i.e. O, is conserved), 
tunnelling between vacua with different winding numbers as discussed in the 
previous section is suppressed. 

Does this mean that the -vacuum is irrelevant in such a situation? Not so, 
as the |n> vacuum violates cluster decomposition. For example, in the 
vacuum state with n = 0, the expectation value for widely separated 
operators will not vanish 


COPIAR, ———_ COPS) m><m|P,(y)10> 


= COP\)|y><VIP,(V)10> # 0 (16.102) 
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because of the presence of ‘abnormal vacua’ with winding number n + 0. 
Eqn (16.102) also indicates spontaneous symmetry breakdown as the VEV of 
operators carrying chirality is nonzero. Thus the -vacuum is still relevant. 

What is the chiral property of the €-vacuum? Under a chiral rotation by an 
angle a, we have 


e~ i251 ae `y eTe- ern 
n 


= |0 + 2aN,), 


i.e. the 0-vacuum is changed to another one with 6’ = 0 + 2aNp. Thus the 
(gauge-variant) current conservation means that the theory is invariant 
under a rotation which changes its vacuum state |@) — |0’>. This implies that 
in the presence of massless fermions @ has no physical meaning: one @ is as 
good as another! 


Small 6-oscillations—the Kogut—Susskind pole 


The situation described above is very much like that we encountered in the 
simple U(1) 4d* theory with SSB of eqn (5.138). There, under U(1) rotations, 
the vacuum states can be changed into each other as indicated in Fig. 5.3. 
Thus we can interpret the parameter 0 in QCD as the parameter which 
characterizes the direction of symmetry breaking. However there is a crucial 
difference between these two spontaneously broken U(1) theories. In the U(1) 
theory of eqn (5.138) all the vacuum states connected by rotations are 
equivalent. In QCD the parameter 0 is like the coupling constant; different 
values of 0 correspond to different Hilbert spaces. In the U(1) theory of 
(5.138) small oscillations around the true vacuum in the angular direction (7) 
which do not cost any energy (i.e. zero energy excitations) are shown to be 
interpretable as physical massless particles (the Goldstone boson). On the 
other hand changes in the QCD 6-parameter are meaningless just as the 
pseudomomenta cannot be changed in the case of periodic potential. Hence 
small oscillations in the QCD @ correspond to an unphysical massless 
particle. But this is just the Kogut—Susskind pole which is required to solve 
the U(1) problem. 


The strong CP problem 


While the presence of instantons apparently resolves the U(1) problem, their 
presence also implies a 6-term in the effective Lagrangian (16.71) which 
violates P and conserves C (hence violates CP). In fact the stringent 
experimental upper limit on neutron dipole moment (see §12.2) can be 
translated into a bound on the QCD 0 parameter, 0 < 107’. This is the 
strong CP problem. How to give a rationale for such a small value, 
considering that it is a strong interaction parameter and a priori we would 
expect it to be O(1). One would think that the only plausible solution is that 0 
is effectively zero. Various ways to achieve this have been suggested; among 
them we list the following three approaches (see, for example, Wilczek 1978). 
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(The first two mechanisms both suggest that there may in fact be an exact 
global U(1) symmetry and the S-matrix, as we have explained above, is 
independent of 0 so that it can be rotated to zero.) 


(1) Zero mass quark. The exact U(1) symmetry comes about because one 
of the quark masses (presumably m,) is zero. However all current-algebra 
calculations indicate that this is an extremely unlikely possibility. 

(2) The axion solution. Increase the number of Higgs mesons so that even 
if m, a #0, QCD together with the standard electroweak theory still has a 
global U(1)pq symmetry (Peccei and Quinn 1977). However one of the 
scalars (the axion) plays the role of Goldstone boson for this U(1)pg. It is 
massless at the classical level and picks up a small mass (much like the n- 
meson) only through the axial anomaly and the instanton interaction 
(Weinberg 1978; Wilczek 1978). As the simplest version of this approach, 
constructed within the framework of the SU(2) x U(1) model, is ruled out by 
experiment, one is forced to postulate such a structure at the SU(5) GUT level 
producing an ‘invisible axion’ with infinitesimally small mass and couplings 
(Kim 1979; Dine, Fischler, and Srednicki 1981). However, this may pose 
problems for the standard theory of cosmology (see, for example, Sikivie 
1982b). 

(3) Soft CP. It has been suggested that perhaps one should set 0 = 0 asa 
symmetry requirement (‘strong interaction conserves CP”). However this in 
itself is not enough, since higher-order (CP-violating) weak interactions will 
generate y,-dependent quark mass terms. To eliminate them one has to apply 
chiral rotations which in turn induce a 8-term. To have a calculable and small 
(<10~°) 6,,, the CP violation in weak interactions must be soft (i.e. by 
operators with dimension less than four). So far no realistic model has been 
constructed. 

Thus we can see that none of the suggested solutions are completely 
satisfactory and the value of 0 remains a puzzle for QCD. For a recent review 
of QCD including topics such as instantons and the U(1) problem, etc. the 
reader is referred to Llewellyn Smith (1982). 

Finally, we comment very briefly on the role of instantons. For a very 
small coupling constant g, corresponding to very small distances r, tunnelling 
is negligible as indicated by eqn (16.63) and QCD perturbation theory can be 
used in appropriate circumstances as shown in Chapter 10. For some 
intermediate values of g one can continue to use perturbation theory but with 
allowance made for vacuum tunnelling. This produces (Callan, Dashen, and 
Gross 1980) a sudden rapid increase in g as r is increased from zero to a 
distance comparable to the hadron size (~0.5 fm). This is compatible with 
the phenomenological quark models and with the results of lattice gauge 
theory calculations discussed at the end of §10.5. For large distances there is 
at present no reliable method for calculating the effect of instantons (in fact it 
is not even clear whether they are relevant at all) although they do remind us 
that the QCD vacuum must be very complicated. 


Appendix A 
Notations and conventions 


Metric 


Metric t et es (A.1) 
etric tensor = = . ; 
A 0 0 -1 0 
0 0 0 =I 


Contravariant coordinate x" = (x°, x1, x”, x°) = (t, x, y, Z) = (t, x). (A.2) 


Covariant coordinate Xn = GuvX” = (t, —X). 
Scalar product A: B= A,B" = A,g""B, = AoBo — A‘ B. (A.3) 
Sus ô ô 0 Ô 
Derivatives C= T3 = (=. -v) c= oe E v) 
(A.4) 
where V= a a 2 , (A.5) 
ôx Oy Oz 
Ao 
Four-divergence OYA, = Tae + V:A. (A.6) 
Pauli matrices 
6 = (c;, 02, 03). (A.7) 
0 1 0 —i l 0 
ENE ok m o E CP) 
[o;, oj] = 2ie*o, e: totally antisymmetric «73 =1. (A.9) 
{Gj a;} => 20;; tr(o; d;) = 20;;- (A.10) 
Completeness: » (F;)av(Fidca = 2 (Ob. Ô ad Em 2 Oab Oca): (A.l 1) 
Dirac matrices 
tyes y} = 2g" Yu = Ip)” (A.12) 
y= (y°, ¥) Ya = Yo. —Y). (A.13) 
. i 
y? = ys =ipypyy Fwy = 5 Dw v1. (A.14) 
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Spin matrix si = feoi, i= 1, 2, 3. (A.15) 
Charge conjugation w= Ch Cy, Ci = -ye (A.16) 
Identity YarpVa = Gapla + IpaYa — Jarg + iEpapaY"Y5- 
(A.17) 
Fierz transformation 
Let Is = 1, Iy = Y, Ty = Cuv Ta = 7,95,Tp = Y5. (A.18) 
Then 2 gi idae i) = 2, GT aE i) yp (A.19) 
i j 


where the indices i, j run over the set S, V, T, A, and P and the g;s are 
related to ĝ; by 


gs l 4 12 —4 1 gs 
gv 1 -2 0 -—-2 -l1 gy 
Gr | =3 2 0 -2 0 2 JT |: (A.20) 
Gn {—1 -2 0 -2 l Ga 
Gp 1 —4 12 4 l gp 


Dirac representation 


o (1 0 (0 6 PN 
7 -( i 1=(°, 5) »-(; 5) oo) 


Hermitian conjugate 


=z GST = 7, (K=1,2,3) of, =o" yS=ys5. (A.22) 


0 
Spin matrix s = (o s) (A.23) 


Charge conjugate C = iy,. (A.24) 


Plane wave and Dirac spinor 


Incoming plane wave e * = exp[ —i(wt — k: x)]. (A.25) 

Outgoing plane wave e** = exp[i(wt — k-x)] (A.26) 
where œw = (k? + m”)!/?, (A.27) 

Space-time translation A(x) = e'?*A(0)e™'?* (A.28) 
where p“ = (H, p) is the energy-momentum operator. 

Klein-Gordon equation (ô? + u*)d(x) = 0. (A.29) 

Dirac equation (iy" ô, — m)p(x) = 0 (A.30) 
In momentum space (p — m)u(p, s) = 0, (p + m)v(p, s) = 0 (A.31) 


where u(p, s), v(p, s) are Dirac spinors. 
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Normalization 


u( p, s)u(p, S) = 2m o,,, v(p, slp, s) = —2m ô  (A.32) 


Projection operators 


È up, Dü p, 8) = (P + m) 


S 


È, Val P, S)0p(P, 8) = (P — Map. (A.33) 


S 


Gordon decomposition 


: 
u(p, s)y"u(q, s) = =— m AP» s)\L(p + q)" + io™(p — q),Ju(q, s) (A.34) 


2 
u(p, s)y"ysu(q, s) = —— vy APs s)\L(p — q)"ys + 10""(p + g)vyslu(q, 5). 


In the Dirac representation nn Dirac spinors are given by 


l , i 
amelie | sp Ves ps8) = (E+ m [zm J 
Etim (A.35) 


ce 1,2 


l 0 
where y; = o Y= a E = (pP? + m)’. (A.36) 


Normalization of states, cross sections and decay rates 
One-particle states are normalized as 
<p, alp’, «> = 2E(21)? 6°(p — Pp) bya’ (A.37) 


where a, «’ label the spin and/or internal symmetry indices. The projection 
for a one-particle state 


2 lp, «><p, a| = [a55 ag DIP. a>< p, al 
d“p F 
- fa J? o(p" =m )8(Po) 2, |p, &><p, aj. (A.38) 


The transition probability per unit time per unit volume for the process i > f 
iS 


wli > f) = (27) ôt (p; — p KETI? (A.39) 
where <f|7|i> is the covariant 7-matrix and is related to the S-matrix by 
CEIS|i> = dip + i(2)* 5°(p; — p)<FITHD. (A.40) 


Decay rate 


The transition probability per unit time per unit volume to a specific final 
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state f is given by 
_ w(i > f) 


dr (i >f) dN, (A.41) 
Where p; = 2£, is the density of the decaying state and 


n 


d3p. 
I] Pj is the density of the final state. 


dN, = T —— 
© jy (22)°2E, 


The total decay rate is 
ra> n= | pt nyt o¢(p, — E p \Keiripes (A42 
2E j=l (2n)°2E, i j=1 : | 


where S is the statistical factor which is obtained by including a factor 1/m! 
if there are m identical particles in the final state, 


1 
© [T aa (A.43) 
Scattering cross-sections 
The cross-section for a; + a, > f is given by 
f 
do(a, + d2 — f) = olar TA dN; (A.44) 


where J; is the flux of the incoming particles and is given by 
J; = Pipav (A.45) 


where p;, p2 are the densities of the initial state and v is the relative velocity 
between two particles in the initial state. The flux factor J; can be written in 
the centre-of-mass system as 


J, = 2E,2E, | — 2) = 4p, E — pil = 4p: (E, + E) 
1 2 
= 4[(p1 P2) — mimi] ” (A.46) 


Note that J; in eqn (A.46) is expressed in terms of the Lorentz-invariant 
quantity. Hence it is valid in any other frame. 
The cross-section is given by 
_ (n)* (py + Pa — Pr) " dp; 


KAT? [I 


g = eos aS (A47) 
4[(p1°p2)* — mimi]? jai (22)°2E, 


Appendix B 
Feynman rules 


In field theory, the standard procedure for calculating any physically 
interesting quantities in perturbation theory is summarized in a set of 
Feynman rules. Efficient use of these rules will greatly simplify the 
calculation. The most important parts of the Feynman rules are the 
expressions for the propagators and vertices, which characterize the structure 
of the theory. In this appendix, we will outline a practical method for 
deriving the propagators and vertices for the most general situation. We will 
also summarize the results for the physically interesting cases at the end. 


A practical guide to the derivation of Feynman rule 
propagators and vertices 


The 4o* case 


We will first discuss the simple case of A@* theory for the spin-0 boson and 
then generalize the result to more complicated cases. When the 1¢* theory 
Lagrangian is divided into the free part Zo and the interaction part Z 


Lo (p) = 26,6)" — 2H" (B.2) 
A 
AG) = -7, 4, (B.3) 


the generating functional for the Green’s functions is given by 
WLJ] = [ras exp fi | LZA) + Alo) + Ja)O)] ats| 


0 
a apli [2 (-i a] a*s| AGA (B.4) 


t] 


WoLJ] = | [de] exp{i GEZE + Js) (B.5) 


where 


Since the free Lagrangian Zo given in (B.2) is quadratic in the fields, the 
integral in eqn (B.5) can be evaluated as follows. Write (B.5) as 


WoLJ] = [rag exp | 0o) — u O + JE) ats| 


= | [dg] exp | BOPO) + JOO) ax} (B.6) 
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where 
P(x) = —(0? + p? — ie) (B.7) 


is a Hermitian operator. In (B.6) we have inserted a factor 
exp{ —4 | ef? d*x}, with e > 0, to make the path integral convergent. Let 
@.(x) be the solution of the classical free-field equation in the presence of the 
external source J (x) 


Pex) = —J(x). (B.8) 


Using the usual Green’s function technique, this equation can be inverted. 
Defining A(x) by 


P(x) A(x — y) = 64x — y), (B.9) 
we have 
d(x) = — | ae — y)I(y) d*y. (B.10) 
It is easy to see that for the P(x) given in (B.7), we have 
dtk 
A = —ik-x , 
F(x) (se € Ar(k) (B.11) 
with 
l 


We now change the integration variable in eqn (B.6) from (x) to $’(x) 
defined by 


P(x) = Px) + G'(X). (B.13) 


Then (B.6) becomes, as in the case of usual Gaussian integrations (see eqns 
(1.49) and (1.81)) 


WolJ] = N epi z. | EI) Arx — y) J(y)] d*x ay} (B.14) 


where the normalization factor 


N= | [d¢’] exp | IPAP) a*s| (B.15) 


is independent of the source function J(x) and is irrelevant for calculating the 
connected Green’s function. The generating functional is then given by 


W\J| = Nexpfi fatx j2, (2) ) expd —5 [axd txa) 
x A(x; — aa) (B.16) 


and the Green’s function can be obtained from 


ô" WLJ] 


COITA) -ADIO = D S — Gay 


(B.17) 


J=0 
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Using the rule for functional differentiation, 
dJ(y) ô 
Ôx  dJ(x) 
it is not difficult to see that the functional differentiation in eqns (B.16) and 
(B.17) will reproduce Wick’s theorem. 
The connected Green’s functions can be obtained by differentiating the 
In W[.J] functional (eqn (1.76)) 
in ô" In WEJ] 
OJ(x,)... OJ(X,) 


[æo — x)J (x) d*x = 6*(x — y) (B.18) 


J=0 7 Se E Xn) = <OIT(P1) i P(Xn))19> conn: 
(B.19) 


The free propagator, a two-point function, zeroth-order in the coupling 4, 
can be obtained from In W,[ J] as 


„ô In MLJ] 
ôJ(x) dJ(y) 
To get the basic vertex in A¢* theory, we take the four-point function 


G(x, ... x4) and keep terms lowest order in A. After some algebra we 
obtain 


COITA) O tree = 1 = iAp(x — y). (B.20) 


J=0 


G(x, sare X4) = —ih [iare = xyAr(x2 = x) 


x iAr(x3 — x)iAp(x4 — x) d*x (B.21) 
or, in momentum space, 


Gk, ... k4) = iAp(k, iAp(k2 )iAp(k3)iAp(k4)(—iA) (B.22) 
where 


(2n)* 6*(k, + k, + k3 + k,)G(k, ae k4) 


4 
= | d*x, ce Gx]... x4). 
i=1 
(B.23) 


After removing the propagators for the external lines, the four-point 1PI 
function (the vertex) is 


rk, k,, k3, k4) = —iÀ. (B.24) 


To summarize, in 1* theory, the propagator (B.12) and the vertex (B.24) 
are given by 


k i 
7> pF (B2) 
Se ye 
X ih (B.26) 
va N 


Note that the propagator in momentum space is just the Fourier transform 
of the inverse of the operator P(x) which appears in the quadratic term of the 
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free Lagrangian (B.6). The vertex is just the coefficient of the interaction term 
multiplied by the factor i and by the number of permutations of identical 
particles in the interaction term. 


Generalization 


To generalize the above result to the case of more complicated interactions, 
consider the Lagrangian 


L(x) = zox) Pp) + xV) + W(X) X,(x)W (0) 
A LQ, Xs ya y, Y) (B.27) 


where @,(7;) denotes a set of real (complex) boson fields that may be scalar or 
vector fields and y; the set of fermion fields. The index i stands for any spinor, 
Lorentz, isospin, etc. index. P, V, and X are matrix operators that may 
contain derivatives and must have an inverse. X and V are taken to be 
hermitian operators while P is taken to be a real symmetric operator 


Xt =X, V=V, and P =P. (B.28) 


It is understood that they contain the ie factor so that the path integral will be 
convergent. A general term in Z (x) has the explicit form: 


FL, = [ats dts .. 


ol sie: (NG Nc ae Rene weiss 


iy ...im...in...tp. 


X Wi(%1) oo Win (Xm) «++ Pi (Xn) ++ VE) 20) 


(B.29) 
Define the inverses of P, V, X by 
5 P, (x) Pj (x — y) = ĝi ôt(x — y) 
j 
> V; (x) Va (x — y) = ĝi ôt(x — y) 
j 
X XOX ju (x — Y) = ôu O° — y) (B.30) 
j 


and their Fourier transforms 


i LE 
Py'@)= | oN 


dtk gap- 
Vij *(x) = lowe en “Vi *(k) 


XG) = lose eH RA 1(k), (B.31) 
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Then the propagators are given by 


j- Av, - | dx e= (OT(b,00)6(0))10) = iLP- 200], 
j-- = —--i A(k);; = [ats e*£0|T(x*(x)x,(0))10> = ILP] 


pe Selk); = | d*x e™*= <0 TO: CAY; = iL E]; 


(B.32) 
For the vertex we define 
d*k, d*k, ik,(x-x iko(x-—x 
tie Oia) = [ERE EEE i Naa ca 
x Oris... (Kis Kas). (B.33) 


& contains a factor ik, for every derivative d/dx,; acting on a field with 
argument x;. The vertex is then given by 


(Cae ee nD 


{1...m—1} {m...n=1} {n...p—1} {p...q—1} {q...} 
x (— 1); (Kis ko»): (B.34) 


The summations are over all permutations of indices and momenta as 
indicated in Fig. (B.1). The momenta are taken to flow inward. Any field ọ 


ik; 


inka 


Fic. B.1. 


(or y*) corresponds to a line with an arrow pointing out; a field w (or y) has 
an oppositely directed arrow. The @ fields are represented by undirected 
lines. The factor (— 1)’ is only of importance if several fermion fields occur. 
There is a factor of (—1) for every permutation exchanging two fermion 
fields. 

For example, in Ag* theory the interaction term can be written 


— — t(x) = E fas ... d*x, t(x — x1) O4(x — x2) 


x t(x — x3) 08x — x4)b(%1)P(%2)O(%3)O(%4) (B.35) 
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Then 
A c4 4 
AX; Xy cc ky) = ae (x —xX,)... 0°(% — x4) (B.36) 
and 
3 À 
ak, eee k4) = AL (B.37) 


The vertex is given by 


Ik, ... kg) =1 D Alki, k2, k3, K4) 


{1,2,3,4} 
=f Gaa (B.38) 
4) 


This agrees with the result (B.24) obtained before. 


General Feynman rules 


Given the propagator and vertices we can now write down the Feynman rule 
for the computation of Green’s functions. 


(a) Draw all topologically distinctive, connected diagrams at desired 
order; 
(b) In each diagram, attach a propagator to each internal line 


—_——.——— for a spin-0 boson 
k? — p? + ie ai 
i 
p= C-i) for a spin-1/2 fermion 
P — m — lEjap 
Spin-1 boson propagators will depend on the theory; 
(c) To each vertex, assign a vertex function given in (B.34), derived from 
the relevant term in the interaction Lagrangian; 


(d) For each internal momentum & not fixed by the momentum conserv- 
4 


ation at vertices, give a factor . 
: (a 


(e) Multiply the contribution for each diagram by 
(i) a factor (— 1) for each closed fermion loop: 
(ii) a factor (— 1) between graphs which differ from each other only by an 
interchange of two identical external fermion lines; 
(iii) a symmetry factor S~* with 


S=g [| Wap 
n=2,3... 
where «, is the number of pairs of vertices connected by n identical self- 
conjugate lines, p is the number of lines connecting a vertex with itself, and 
g is the number of permutations of vertices which leave the diagram 
unchanged with fixed external lines. 
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For example, 


—< > g=1,«, = 1, ß = 0, and S = 2! 


—<>— g = 1, «3 = 1, p = 0, and S = 3! 
$ g=1,«,=0,ßf=1,and S=2 


SIX è g=2a=1,ß=0,andS=4 


(£) The proper (or one-particle irreducible) Green’s function '(p, ... p,) 
comes from the one-particle irreducible (1PI) diagrams 

(i) For the connected Green’s function G(p,...p,), attach propa- 
gators for the external lines; 

(ii) For the scattering amplitude 7(p, ... Pa), put all the external lines on 
their mass-shell, i.e. p? = m? and provide external fermion lines with 
spinors: u(p) [or v(p)] for fermions [or antifermions] entering with 
momentum p; u(p) [or o(p)] for fermions [or antifermions] leaving with 
momentum p. Provide external vector bosons with polarization vectors: 
é,(k, A) [or ef(k, A)] for the vector boson entering [or leaving] with 
momentum k. 


Summary of 107, Yukawa, QED and QCD propagators and 
vertices 


(a) Abt and Yukawa theory of scalar—fermion interactions. 
l Bob eh A aa ae oiT 
L =5 (0,6) -5P — 5 o* + Iy — my + igh 


scalar propagator: ~~ Da IAR(k) = 


fermion propagator: &——?——~4@ iSE(D)ap = (5) 


i . SNO ee 
vertices: so iA NË (15 ag 
2 NG B 


(b) QED. 


l l = 
L = = (0,A, — ô, An) — gg CAY + wiy:d — ey: A — m)w 


i 
Fermion propagator: 8b——>—— « | 1S;,(p),, = | ——————- 
propag p P(P)ag a). 


Appendix B Feynman rules 505 


v : =o 
Photon propagator: et eas iD-(k),y = ae [ony + (Ë — 1)k,„k,/k?] 


Feynman gauge 


C= 
č = 0 Landau gauge 
H 


Vertex: Bx —1e(Yy)ap 


B a 
(c) Scalar QED. 


1 l 
L =~] yA, = 0,4, — zg 0,4") 


+ (0, + ied, DETTO + ie] — H'O -$ (64). 


; i 
Scalar propagator: =~=>---  iAp(k) = eoe 


Photon propagator: S" iDp(k), = ao Lg., + (E — 1)k„k,/k?]. 


€ = | Feynman gauge 
č = 0 Landau gauge 


u 
: N “ 
rtices: = RASSA È -l 
Vertic m P Mu po 2ie* guy C —id 
N ri 


(d) Non-Abelian gauge theory. 
l 
F = — 7 (uA — ÔsAp + GCP ALASO A” — 0A“ + gE At Ae’) 


l 
“26 
+ Piy ð" — igA“T*) — mly 


(0,A%)? er 0 ,(0" ô — gC™ A )y° 


À 
+ [O — igA™ RTT, — igALR”)A] — m3o'd — F (0'9). 
Vector meson an” DE yy = pee [on + (E Dkk, /k?] 
propagator: “Ok i "Y k? tie” j 
€ = 1 Feynman-~’t Hooft gauge 
č = 0 Landau gauge 


Fermion propagator: ene iS (p) = O i i 
E E — m + iE jag 
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id!” 
Scalar propagator: diya ROR tacos lO = 
propag came iAf (k) 2a 
Ghost propa ator: pf vcccccces poecccces a iA2(K) E a 
~ i en’ ke? + ie 


Vertices: igC [(ky—ky)y Bayt (Ky Ky Ba tlk — ky) Sua] 


Ac 


u,a , 
—ig? bes? Ci Ba gis ZE up 8vA) 
+ C8" Ce (SupZav ~ Suv Zap) 
+844 CC (g uv 8pà—guàgpv) 
p.d À,c 
u,a u,a 
; gc% p, 18% ag (T); 
qaa yD l 
7 ° 
c b BJ œ, 
H.a 
u,a v,b 
; ig(pt+p')y Rim AF aie: (R°, Rim 
7 NW ! S N 
Pa xP A N 
7 N / N 
m l m l 


For QCD we have 
CO a Fee with a, b,c = 1, 2,...,8 
and the representation matrices 
(T°); = (A*/2);; with 7, 7 = 1, 2, 3. 
(R*) in = 0 


R: gauge Feynman rules for the standard SU(2) x U(1) 
theory 


Since these rules are slightly more complicated we provide some steps in their 
derivation. 


The gauge-fixing and FP ghost terms in the effective Lagrangian 


After spontaneous symmetry breaking the covariant derivatives of the scalar 
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field give rise to mixing terms between vector bosons and scalar particles 


T 


B t 
(D,)'(D"#) = (6, — ig Ay — ig Sle + <H>) | 
Bt 
x IG — ig 5 AY — ig’ +e + 0o) 
- a| -a0 5 <b)0 + <d")05 a | 


B" 
+ ig’ = Koo up — Aub" OA] +- (B.39) 


For practical calculations, it is more convenient to cancel such mixing terms. 
For this purpose we choose the gauge functions (9.27) to be 


fi = ôA} + e(o" + <#>0— <0 v) for SU(2) 


f = 0,BY + ig’ : (#"<oy. = <g> ) for U(1)  (B.40) 


so that the gauge-fixing term (9.68) is of the form 


l , 2 
L of = TJE [aa + e(o" 5 <d>0 — <b > ) 


ar opm +e (0o - w) . (B41) 


To calculate the Faddeev-Popov ghost from these gauge conditions, we 
make the infinitesimal SU(2) gauge transformation, with gauge function 
u(x), 


; e l À 
6A‘ (x) = u(x) A(x) — r 0,u'(x) 


Sb(x) = -i5-ulx) (x) 
or 


5p’ = -iz ug + (po). (B.42) 


Then 
Of; = Ô, hu = - a, | E ueo" + (o"o)i su S <o 
Ho iz ue + Ao) | 


if =iF G + LPi 5 Ww HY0 + <boi 5 UCG’ + <o>) | 
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or 


of, = |- : O"[0i; On = ij A] 
g 
l T; H 
-g| "> 5 <b)0 dy + OE hyo + Oo alc 


f= =F e| eao + 6°32 yo + CE ag (B.43) 


Similarly under the infinitesimal U(1) gauge transformation, with gauge 
function a(x), 


5’ = -i5($' + <o), (B.44) 
we have 
of = — FELCH" Dom) + HtHOYo 
+ <b" otp Jo 
if = LI + T KADA + Oo 
+ Ooo Ia (B.45) 


We can combine (B.43) and (B.45) in matrix notation 


5 = as Y)ij a (B.46) 


(x) Mx, y); Mix, y) /\aly)g 
where 
M;(x, V)ij == -doa On = g& AX] + pe RL Oij aE p ai [Po 


+D al ix») 
M,(x, y); = = KZ + p” 4 <P>o + <b'>o a a 
x 54(x — y) 


M,(x, y) = -f7 + 7e [21 bol? + 6" b>0 + Zi ô*(x — y). 
(B.47) 
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The FP ghost-field Lagrangian (9.69) is then given by 


M,(x, Vij M ,(x, dd wag 


ZL ppg = [ats diyor), x'09) fe y); M(x, y) Ax) 


(B.48) 


Propagators and vertices for bosons and FP ghosts 


Define the physical vector bosons as 


l 
Wi=— 


Z,, = cos OwA? — sin OwB, 


(Al, FiA?) 


u 


A, = sin 0wA? + cos OwB, 
and write the scalar mesons as 
pr 
p= | 
ı + id2 
Re 
For the ghost fields we can define similar combinations 
wt = (w, F iw) 
Wz = cos byw; — sin Owy 
w, = sin Oww, + cos Owx. 
With these definitions, we can easily work out the propagators from the 


quadratic part of the Lagrangian Z, + L3 + La + rro in eqns (11.37), 
(11.59), (B.41), and (B.48) 


+ ==] 
Ps 
WT ANANA K-M tie [guvt (6—1) ky ky/ (k= EM} )] 


Z ANANN —_—— AEDk kk = EM? 
KM vie De tE- Dkk (k? — EMD) 
ee es O 
k?— M}, +ie 
i 
í k? — EM3 + ie 
1 
(ieee k?—2pu?+ie 
E E T =i 
k?—¿M}+ie 
G) *oecooccoccosoococooooe —i 
i k?— EM} +ie 
panai =l 
k?+ ie 


where € = 1 °t Hooft-Feynman gauge, č = 0 Landau gauge, and č = œ 
unitary gauge. 
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The boson vertices are 
A,(k;) 
—ie[ (kK, ky) Sy + (kK), Byatt (kg ky Bay] 
Wy (k) Wz (ks) 
Z, (ky) 


—ig COS Oy [(k; =k); Buy tkrk )u8va tks — ky), 8 ay] 


Wt (k,) W; (k3) 
+ + 
wi wt A, A, Z, Z, 
ig? SuyAp —ie* Swap —ig? cos? Oy S UYAP 
Wy W; wi Wa Wi W> 


as well as the AZWW vertex —iegcos@,S,,;. with Syyrp = 29uGip— 
9uAGvp—Jnup9va- In graphs below all charged boson lines are taken to be entering 
into the vertices. 


~ “XN 
wet S 
$ p_) p (p4) 
A Z 
. [1—2sin ðw g 
ge | 2cos Oy, | PTBA 2 cos Oy PP), 
sA As pea ~~ 
1 N 1 Ny 
$ (P) $* P4) $ip) $02) 
W, Wy 
ig 
3 oe (+P), 3 > (P4—P2), 
wae TRL P E, 
$i(p,) +p) $P) tpa) 
wi wi 
a 8 (\—P_), 3 m W—P-), 
pr Tike, we Su. 
$P) $ (p_) $(P2) $~ (p_) 
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$t 
| 
| 
| 
a ih = 
Au Wy 
$, 
| 
| 
Poe i 
Wi Wy 
A, A, 


/ N 
Pa x 
pb 7 
S i 
VA Xs JS 
\ 
ye i Fa 
$, $, $» 
Au Wo 
rn a 
y \ 
( Ss 
$ $ 
Au Wy 
see 
Yairi 7 Suv 
Fi \ 
J, x 
L W 
$ $ 
Lu Wy 


eR tan Ow Sy 
\ 
/ $ 
/ N 
$ $i 
Wa Wy 


Zi Zy 
Zi Zy 
7 Xe 2 cos? Ow ay 
Z 
x R 
/ a 
p $ 
Zy 
. 2 
1g 
7, SHY 
2 cos* By, 
N 
Si 
p 
Au WwW, 
ge 
Z — Bw 
N H 
P S 2 
ie oN 
gr $, 
l ‘i, 
ge 
F a F tanOy g 
Z N 
A n 
$ $, 
Fa ZEE tan Ow Zv 
/ SR 
/ N 
+ 
$ $a 


d 
| 
| 


Z; Wy 
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Inclusion of leptons and quarks 


Propagator: À AEE ; 


Vertices for leptons: l = (e, p, T), Vi = (Ve, Vu, Vx) 


for quarks q: p = (u, c, t), n = (d, s, b) with the CKM mixing matrix U,, of 
eqn (12.39). 


H 
— ie Y, pe Qq Yu 


l q q 
w7 W; 


ig a 
Jus = g pon Y) Upa 
Y, (1— Ey [(1—¥ sin’ey)—¥s) 
oe a 4 cos Oy 


v p p 


= - 2 ig 
Jat cos 6,, Yall 1+ 4sin Ow) +75] AS Pal Fes zsin 2 Ow) + ys] 


4- 4- 
| 
| 


' 
rns M, ae 


v 


Mt" (i—-¥5)—m, (It+y5)] U 


v p n 
f f 
| —igm, | —igm | —igm, 
l l n n p p 
f i $2 
m gm 
| lil Y5 | P Ys 
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discovery 280 
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axial-vector current Ward identity anomaly 173- 
| 82; see also anomaly 
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BPH renormalization 39-45, 56, 67 
BRS transformation 273-6 
baryon masses 119-20, 165-6 
baryon number 
asymmetry in the universe 444-6 
baryon minus lepton number, see B — L 
conservation 
in QCD 292 
in standard electroweak theory 355 
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Bjorken scaling, see scaling 
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Kobayashi—Maskawa theory 359-61 
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Cabibbo angle 136, 210, 337, 356, 371, 373; see 
also Kobayashi—Maskawa mixing 
Cabibbo—Kobayashi-Maskawa (CKM) matrix, 
see Kobayashi-Maskawa mixing 
Cabibbo universality 373 
Callan—Gross relation 207-8 
Callan-Symanzik equation 70-7 
canonical commutation relations 4, 248 
canonical dimension 58-9, 61, 77; see also anom- 
alous dimension 
canonical quantization formalism 3-11, 255 
Casimir operator 91, 408 
charge conjugation 412, 429, 445-6, 448; see also 
Majorana fermion 
charge quantization 346 
grand unification 430-1 
magnetic monopole 455, 469 
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Zweig rule 
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algebra 132, 136 
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Coleman—Weinberg mechanism, see radiative in- 
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singularities 
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anomaly cancellation 348 
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fractional quark charge 431 
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colour flux tubes 323 
composite field operators 62-6, 73, 221 
mixing under renormalization 65-6, 307-10, 
314 
composite Higgs particle 401-2 
condensate, see vacuum expectation value 
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and asymptotic freedom 322-3, 335 
criteria 331-3 
linearly rising potential 323, 333-4 
Wilson loop 331-3 
conserved vector current (CVC) hypothesis 136 
cosmology; see also baryon number asymmetry, 
cosmological bound on neutrino masses 
closed, open, flat universe 415-16 
critical mass density p, 415-16 
Hubble’s law and constant 415 
primodial 
blackbody photon radiation 415-16, 445 
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neutrinos 416 
coupling constant; see also asymptotic freedom, 
gauge field coupling, pion—nucleon 
coupling 
definition 36, 52, 59, 81-2 
dependence on subtraction point 81-2 
dimension and renormalizability 57 
effective 75-6, 81-5, 295-6, 437-41 
renormalization 36-7, 285 
‘unification 437-41 
counterterm 
BPH renormalization 40-5, 58—60 
composite operator 64-6 
gauge theory renormalization 267-8 
Goldstone boson mass, absence of 186-7 
minimal subtraction scheme 78-80 
soft symmetry breaking 186-8 
covariant derivative 230, 232, 233 
as connection 236 
SU(2) x U(1) 348 
SU(5) 439 
covariant gauges 257-67; see also Landau gauge, 
Lorentz gauge, R; gauge, °t Hooft- 
Feynman gauge, unitary gauge 
currents | 
anomaly 480, 488-9 
axial-vector 173-4, 223, 488 
commutator 130, 136, 200-1, 223-5 
physical 
electromagnetic 134, 200, 222, 340 
weak 
charged 135-6, 204, 337, 340, 353 
neutral 336, 342; flavour-conserving 353- 
4, 362, 364-71; strangeness changing 
337, 362-3, 371-2, 378-83 
symmetry (Noether) 127-8, 134 
topological 463 
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current algebra 125, 180, 279, 488 
Adler’s sum rule 137-41 
soft pion theorems 151-67 
Adler’s consistency condition 155, 161 
Adler—Weisberger sum rule 158-60 
n > 3x problem 167, 490 
Goldberger-Treiman relation 153-5, 160, 
161 
n° —> 2y 180-2 
nN scattering length 159 
o-terms 157-9 
nN o-term 161, 165-6 
pseudoscalar meson masses 119, 164; 
Dashen sum rule 167; y-mass problem 
488-92 
current commutators 130—2, 136, 201, 223-5; see 
also current algebra 
current conservation 127, 144, 170, 174, 201 
symmetry conservation laws, see Noether 
theorem 
topological conservation laws 463—4 
current-current interaction 
baryon number nonconserving current 443 
charged weak current, see V—A interaction 
neutral weak current 354, 365 


d% symbol 163, 432 
Dashen sum rule 167 
decoupling of heavy particles 409, 436, 437-41 
deep Euclidean limit 
asymptotic behaviour of Green’s functions 73- 
7, 81, 280 
deep inelastic JN scattering 199-214, 218-26 
deep inelastic scattering 84, 199 
dimensional transmutation 196-7 
diagonalization of mass matrices 357-9 
diquark 442; see also leptoquark 
Dirac monopole 453 
quantization condition 455-60, 469 
and angular momentum 455-6 
string 456-60, 470 
and ’t Hooft—Polyakov monopole 469-71 
Wu-Yang formulation 460 
Dirac quantization condition 455-60, 469 
Dirac string 456-60, 470 
direct product group 87 
dispersion relation 159-60, 301-2 
divergences 30, 33 
disjoint 45 
index of 57, 59, 188 
linear 34, 176 
logarithmic 33, 34, 176 
nested 45 
overlapping 45 
power counting method 56 
primitive 33, 44 
quadratic 34, 42, 50, 437 
superficial degree of 40-1, 43, 57, 73 
double B-decay, neutrinoless 414 
dual field 454, 480 
self dual field 481 
duality transformation of Maxwell equation 454 
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Drell~Yan processes 
quark—parton model 216-18 
perturbative QCD 322 
W and Z production 394 
dynamical fermion masses 168, 408: see also 
constituent v. current quark masses 
dynamical symmetry breaking 401-9; see also 
composite Higgs particles, technicolour 
e*e” annihilation; see also forward—backward 
asymmetry in e*e7 > up , quarkonium 
jets 216 
total hadronic cross-section 
colour 216 
free field singularities 223-4 
QCD corrections 296-8 
quark—parton model 215-16, 291 
ratio R 215-16, 224 
effective potential formalism 189-91 
eightfold way 90, 114, 116-21 
electric charge; see also charge quantization 
as gauge coupling 230 
in electroweak theory 340-2, 346-8, 353 
and Weinberg angle 353 
electric dipole moment 385-6, 492; see also CP 
electromagnetic current, see currents 
electroweak unification 336, 339-40 
-mass problem 488-92; see also axial U(1) 
problem 
n — 3n 167, 490; see also axial U(1) problem, 
isospin violation by strong interactions 
energy-momentum tensor 309-10 
Euclidean field theories 
fermion 328 
gauge theory 479 
scalar 326 
Euclidean gamma matrices 328 
extended technicolour 406-9 


f% symbol 98, 131-2, 163 
factorization of singularities 296, 298, 315-16, 
321; see also operator product expansion 
Faddeev—Popov 
ansatz 254 
determinant 253-4, 257-8 
ghosts 258, 261-3, 269-73 
absence of ghosts in Abelian case 257; in 
axial gauge 255, 273; in lattice gauge 
theory 333 
BRS transformation 274 
loop 263, 270, 285 
and unitarity 273 
Ward identities 272, 277 
Fermi constant 135, 204, 336, 339, 353, 373 
gauge boson mass 353, 355 
Fermi theory of B-decays 336 
Fermion family replication problem 363, 429; see 
also generation 
Fermion masses; see also chiral symmetry, quark 
masses, vacuum tunnelling 
Dirac and Majorana types 412-14 
mass matrix 357-9; see also biunitary trans- 
formation, mass eigenstates 
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in theories with spontaneous symmetry break- 
ing 151 
in Weinberg-Salam model 351, 357, 395 
in technicolour theories 405—9 
in minimal SU(5) model 449-51 
ferromagnetism 142-3 
Feynman parameters 46 
Feynman rules 3—4, 22, 498-512 
Abelian R; gauge 264-5 
covariant gauges 259-63, 504—6 
fermion loop 23, 29 
Agd* 10-11, 183, 186, 498, 504 
path integral quantization 498-503 
QED 504-5 
R; gauge Weinberg-Salam theory 506 
scalar QED 505 
Fierz transformation 354, 382, 443, 495 
fine structure constant (at M,) 388, 438 
fixed point 82-3; see also renormalization group 
form factors 153, 202; see a/so structure functions 
forward—backward asymmetry in e*e” > utu” 
368-9 
four-fermion effective Lagrangian, see current- 
current interaction 
functional integration 3, 12, 23 


GIM mechanism 337, 262-3, 371-2, 378-83, 420, 
427; see also weak neutral current, charm 
mass insertion 381-2 
power v. logarithmic suppression 382-3 
ga 153, 160 
GUT, see grand unification 
gauge; see also Euclidean gauge theory, Higgs 
phenomenon, lattice gauge theory, pure- 
gauge, instantons, solitons 
coupling constant 230, 232, 234-5; see also 
electric charge, coupling constant 
unification 
field 230, 232, 234, 238, 240 
couplings 
fermion 231, 263 
ghost 262 
self 234, 262 
transformation 
gauge 230, 232-4, 479 
singular 457-9 
topological properties, see instantons, 
solitons 
fixing conditions 242, 248, 252, 258-9; see also 
Ay =0 gauge, axial gauge, covariant 
gauge, renormalizable gauges, unitary 
gauge 
absence in lattice gauge theories 333 
parameters č 259, 261, 264-7, 274, 285-6, 426, 
505-9 
symmetries (interactions) 
Abelian 229 
non-Abelian (Yang-Mills), 231 
eigenstates 356, 371, 410, 430, 447; see also 
mass eigenstates 
theories, see QCD, QED, QFD, Weinberg- 
Salam theory, GUT 
history and origin of name 235-6. 
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Gaussian integral 13, 19, 249, 256-8; see also 
Grassmann algebra 26-9, 258 
Gell-Mann-Nishijima relation 115, 135 
Gell-Mann—Okubo mass formula 119-20, 164, 
166 
generating functional 16-21, 23, 29, 189, 249, 
255-61, 275; see also vacuum-to-vacuum 
transition amplitude 
generation 340, 355-63, 429, 447; see also fermion 
family replication problem, kinship hy- 
pothesis, interfamily couplings 
generator (symmetry group) 88-90; see also 
Noether theorem, SU(2), SU(3), SU(n), 
SU(2) x SU(2), SU(3) x SU(3), SU(5) 
geometry of curved space 236-8 
Georgi—Glashow models 
SO(3) 341-2, 466-72 
SU(5) 428-52, 472-5 
Ginzburg—Landau theory 143 
Glashow—Weinberg—Salam 
Weinberg—Salam theory 
global symmetries 86, 125-8; see also B, B — L, 
chiral symmetry, lepton number, muon 
number 
gluons 124, 208-9, 214, 292, 312-14, 322-3, 331, 
404-5, 432 
Goldberger—Treiman relation 153-4, 160 
Goldstone boson 144-5; see also pseudo- 
Goldstone boson, would-be-Goldstone 
boson 
in Abelian case 147-9 
and Higgs phenomenon 243 
Kogut—Susskind pole and axial U(1) problem 
490-2 
in o-model 150 
in strong interactions 151-2 
and technicolour idea 403-5 
Goldstone theorem 144-52, 240 
Abelian symmetry case 147-9 
non-Abelian symmetry case 149-51 
grand unification 428-52, 472-5 
baryon number nonconservation 442-6 
charge quantization 430, 472 
coupling constant unification 437-42 
gauge hierarchy problem 402-436 
lepton—quark mass relation 446-51 
monopole 453, 472-5 
Grassmann algebra 23-9, 274 
Green’s function 8—10, 14-20, 189 
amputated 10, 36 
asymptotic behaviour 73-7 
composite operators 62-5, 70, 73 
connected 8—9, 18, 20-1, 36 
Euclidean 16-17 
one-particle-irreducible 31, 36 
renormalized v. unrenormalized 39, 171 
Gross—Llewellyn-Smith sum rule 217 
Group theory 86-90 
group parameters 88, 90 
rank 90 


theory, see 
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Heisenberg picture 6, 11, 126 
helicity amplitude 205, 208, 343 
Higgs particle 243, 244, 263 
elementary v. composite Higgs 401-2 
composite Higgs (technicolour models) 402-9 
charged Higgs 400 
triplet Higgs 418-19, 466-7 
in Weinberg—Salam theory 351, 355, 394—400 
couplings to fermions 349-50, 357, 395 
couplings to gauge bosons 390, 395 
mass 351, 396-8 
production 398—400 
Higgs phenomenon 240-7 
Abelian case 241-3 
non-Abelian case 243-7 
in Weinberg—Salam theory 348-52 
in SO(3) model 466-7 
in SU(5) GUT 434-7 
homotopic classes 477 
S! — St 477-8 
S? — S? 467, 476 
S? —> S? 478-9 
hypercolour, see technicolour 
hypercharge 115 


infinite momentum frame 140 
infrared divergences 316-17, 321 
instantons 292, 476-93; see also axial U(1) prob- 
lem, axion, anomaly, homotopic classes, 0- 
vacuum, strong CP problem 
and path integral formalism 484, 487 
size 481 
vacuum tunnelling 482-7 
interaction picture 6 
interfamily couplings, suppression 373—4, 447-8 
intermediate vector boson 
charged (W) 135, 339, 342-4, 351-3, 379, 386- 
94 
neutral (Z) 344, 351-5, 364-9, 386-94 
isospin 90, 114-15, 129; see also SU(2) 
and chiral symmetry 125, 167 
structure of 
o-term 157-8 
mN amplitude 159 
violation 
by electromagnetism 115, 167 
by strong interactions 166-7 


J) 122; see also charm 
Jacobi identity 89 
jets 216, 322 


K°-K° system 374-85; see also GIM, CP vio- 
lation, charm 
K,—Kg mass difference 372, 377, 380-1 
CP parameters 376-8, 383-5 
KM-matrix, see Kobayashi-Maskawa matrix 
kink 462 
Kinoshita-Lee-Naunberg theorem 317 
kinship hypothesis 448 
Kobayashi-Maskawa mixing matrix; see also 
Cabibbo angle, quark mixings 
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CP violation theory 361, 378, 383-6 

GIM mechanism 380-3 

GUT (kinship hypothesis) 449 

leptonic mixings 361, 410 

in W-decays 388-90 
Kogut-—Susskind pole 492 


ladder diagrams 320 
Landau gauge 196, 263, 505 
Landau singularity 84 
lattice field theory 324-31 
fermion field 328-9 
gauge field 329-35 
link variable 330 
Monte Carlo calculation 325, 335 
plaquette 330, 334 
scalar field 326-8 
lepton flavour number 362, 409; see also lepton 
mixing angles, muon number, u > ey, 
neutrino masses 
lepton nucleon scattering; see also neutrino nu- 
cleon scattering 
Bjorken scaling and quark—parton model 199- 
214 
Altarelli-Parisi equation 312-13 
light-cone singularities in free field theory 
224-6 
QCD calculations, operator product- 
expansion and renormalization group 
equation 298-311; perturbative 316-22 
sum rules, see Adler, Gross—Llewellyn- 
Smith, momentum sum rules 
current algebra, testing, see Adler’s sum rule 
elastic 201-2 
inelastic 84, 134-41, 199-214, 224, 298--311, 
316-22, 369 
polarized eD asymmetry 370-1 
weak neutral current structure, 369-71 
lepton number 355, 414, 418 
leptonic mixing angles 361, 410; see also neutrino 
masses and oscillations 
leptonic weak charged current 136, 204, 337 
leptoquark 442; see also diquark 
Lie algebra 89, 91, 127 
Lie groups 87-8, 433 
light-cone singularities 218-26, 299-300 
Linde—Weinberg bound on Higgs mass 396-8 
linearly rising potential 323, 333, 334 
loop expansion 191-2 
Apt 193-6 
scalar QED 196-8 
as expansion in Planck’s constant 192 
local symmetries, see gauge symmetries 
Lorentz gauge 248, 259 
low energy theorems 
single soft pion 153-5 
two soft pion 156-67 
soft photon 156 


magnetic current 454 
magnetic moment of gauge vector particle 288 
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magnetic monopole 453; see also Dirac 
monopole, grand unification monopole, 
solitons, °t Hooft—Polyakov, topological 
conservation laws 
and charge quantization 454-5, 469 
and angular momentum 455-6, 471 
Majorana mass 412-14; see also charge conjuga- 
tion, neutrino mass, neutrinoless double 
B-decay, neutrino—antineutrino oscillation 
mass; see also fermion, Higgs, pseudoscalar 
meson, neutrino, quark, vector meson 
masses 
eigenstates 356, 371, 410, 430, 447-9; see also 
gauge interaction eigenstates 
effective (running) 81, 449-51 
insertion 382; see also GIM mechanism 
renormalization 30, 34, 449-51 
Goldstone boson 186-7 
self 32-5 
singularities, see collinear singularities 
Meissner effect 240-1, 323 
minimal gauge coupling 231, 232 


mixings 
composite operators under renormalization 
65-6, 309-10 


lepton couplings 361, 410; see also lepton 
flavour number 
neutral vector bosons in SU(2) x U(1) elec- 
troweak theory, see Weinberg angle 
ob, 120-1 
quark couplings 359, 371-3; see also quark 
mass matrix, Cabibbo angle, KM mixing 
momentum sum rule 213-14, 308-10 
most attractive scalar channel (MASC) 408 
rnultiplicatively renormalizable 39, 64—5, 73, 77 
u — ey 362, 409, 420-7 
muon number, 362, 409; see also lepton flavour 
number 


Nambu-—Goldstone boson, see Goldstone boson 
neutrino flavour counting and Z decay 393 
neutrino mass 361, 409-20; see also lepton flavour 
number, neutrino oscillations, neutrinoless 
double B-decay 
cosmological bound 415-16 
Dirac v. Majorana types 412-14 
in GUTs 418 
from radiative correction 417 
in SU(2) x U(1) models 417-19 
neutrino electron scatterings 338, 354, 364-8 
neutrino—nucleon scattering 
Bjorken scaling and quark—parton model 204— 
14 
testing current algebra (Adler sum rule) 137-41 
weak neutral current (Weinberg angle) 369-70 
neutrino oscillations; see also neutrino mass, 
mass v. gauge interaction eigenstates, solar 
neutrino problem 
flavour oscillation 410-12 
neutrino—antineutrino oscillation 420 
oscillation length 411 
Noether theorem 126-8 


532 


non-integrable phase factor 329, 332 
nonexceptional momenta 73 
non-leptonic weak processes 338, 384 
non-perturbative effects 
instanton 476 
lattice field theory 322 
soliton 460 
non-renormalizable interactions 57 
normalization points (mass scale parameter) 
in momentum subtraction schemes 67—70 
in dimensional regularization scheme 77 


O(4) 352, 481; see also SU(2) x SU(2) 
one-particle-irreducible (1PI) 
diagrams 31, 192 
Green’s functions 31, 62, 192 
w-d mixing 120-1 
ideal mixing 121 
operator product expansion 298—308 
factorization of singularities 298 
light-cone expansion 299 
short distance expansion 298 


PCAC 151-8, 161-2 
paramagnetic property of Yang-Mills vacuum 
287-9; see also asymptotic freedom 
parity nonconservation 336, 338 
in atomic physics 370 
partial wave 338, 343-4 
partially conserved axial-vector current, see 
PCAC 
parton; see also parton model, quarks, gluons 
point-like constituents of hadron 203, 206, 
208-9 
sea 209, 211 
spin 1/2 nature of charged partons 207-8 
parton model; see also deep inelastic scattering, 
parton, QCD-parton picture, scaling 
Drell-Yan process 216-18 
ete” annihilation 214-16 
impulse approximation 206 
lepton—nucleon scattering 206-14 
parton distribution functions 207-12 
Paschos—Wolfenstein relation 370 
path integral quantization 11-17 
scalar 17-22 
fermion 23-9 
gauge theories 248-67 
perturbation expansion; see also Feynman rules, 
non-perturbative effects, renormalization 
canonical formalism 6-8 
interaction picture 6 
U-matrix 7 
chiral perturbation 161 
path integral formalism 
Apt 19-22, 498-501 
gauge theories 260 
perturbative QCD 311 
phase transition 325, 335 
n° —> 2y 180-2 
anomaly 181 
colour 182, 291 
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pion nucleon coupling 153 
pion nucleon (nN) amplitude, see Adler’s con- 
sistency condition, Adler—Weisberger sum 
rule, isospin, scattering length, o-term 
polarizations of vector particles 271, 342-3 
polarized eD asymmetry 370-1 
Pontryargin number, see topological winding 
number 
propagator functions 499-509; see also Feynman 
rules 
fermion 23, 221, 263 
ghost 261 
Higgs scalar 264 
scalar 5, 9-10, 31-2, 34-5, 68, 170-1, 219-20 
vector 60-2, 261, 264, 339 
R; gauge 265 
would-be-Goldstone boson 265 
proton decay 442-4 
SU(5) amplitude 443 
model-independent approach 443 
pseudo-Goldstone bosons 246 
in technicolour theories 405, 407, 409 
pseudoscalar meson masses, see current algebra 
o-term, Dashen sum rule, y-mass prob- 
lem, Gell-Mann—Okubo formula, Gold- 
stone boson, quark masses 
pseudo-particles 476; see also instantons 
pure gauge 480 


QCD 124, 279; see also gluons, quarks 
effective coupling constant 295-6, 312, 438; see 
also asymptotic freedom, confinement 
global symmetries 125, 168, 294; see also chiral 
symmetry, strong CP problem 
Lagrangian 291 
operator product expansion and renormaliza- 
tion group calculations 291-311; see also 
e*e annihilation, lepton nucleon scatter- 
ing, Byjorken scaling 
parton picture 208-9, 214, 312; see also 
Altarelli—Parisi equation 
perturbative calculations 311-22; see also 
lepton nucleon scattering 
scale parameter 296, 335, 403, 442 
vacuum 323, 493; see also vacuum in Yang- 
Mills theories 
QED 
gauge theory, as prototype 229 
quantization, difficulty of 248 
renormalization group 67 
B-function 283-4 
charge screening 280 
renormalization programme 31 
Ward identities 
axial-vector (anomaly) 173-9 
vector 174, 267-8, 272 
QFD 336—427 
quantization 
canonical formalism 3 
scalar 4—11, 170, 219-20 
fermion 22-3, 221 
lattice field theory 325 


Index 


path integral formalism 3, 498-503 
scalar 11-22 
fermion 23-9 
gauge theories 248-60 
lattice field theories 325, 327-8 
theories with magnetic monopole 454—60 
quantum chromodynamics, see QCD 
quantum electrodynamics, see QED 
quantum flavour dynamics, see QFD 
quark 
currents; see also baryon number, colour, GIM 
mechanism 
electromagnetic 134-5, 210, 346; fractional 
charge 117-18, 123, 346-8, 431, 474 
weak; charged 136, 210, 359, 371-4; neutral 
353, 362, 364-5, 371 
masses; see also chiral symmetry 
chiral symmetry breaking 125, 152, 160-8 
constituent v. current masses 168 
SU(5) relations to lepton masses 449-51 
sizes m,, Ma, m, 167-8; m., m, 122 
strong CP problem 493 
model, simple; see also Gell-Mann—Okubo 
formulae, od mixing, Zweig rule 
and chiral symmetry 131-4; see also axial 
U(1) problem 
eightfold way SU(3) flavour symmetry 115- 
21 
paradoxes 123 
quantum numbers 117-18 
quarkonium 122-3; see also J/W, Y 


R: gauge 
Abelian 263-7 
non-Abelian case 267 
standard SU(2) x U(1) theory 420, 506-12 
radiation gauge 248 
rank 90, 433-4 
regularization schemes 33, 45 
covariant cutoff (conventional momentum 
cutoff) 45-52 
momentum subtractions 67-9 
dimensional regularization 52-6, 78, 267, 278 
minimal subtraction 78-9 
renormalizable gauge 247, 264; see also R; gauge 
renormalizability 45, 56, 60 
multiplicative 39, 64 
renormalizable interaction 57 
non- 57, 61 
super- 57 
soft symmetry breaking 187-8 
vector field 61-2, 263-4, 340 
renormalization; see also anomaly, perturbation 
expansion, renormalization group, 
renormalizability 
constants 40, 267-8 
coupling constant 37, 285 
vertex Z, 37 
Z, 285 
wavefunction (field) Z, 35 
Z, 285 
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composite operator 62-6 
mass 34 
Goldstone boson 186-7 
Ad* theory 31-45 
physical picture 30 
programme 
BPH 39-45 
conventional 31-9 
SSB theories 182-8, 189-98 
Yang-Mills theory 284-6 
renormalization group; see also anomalous di- 
mension, effective (running) coupling 
calculations 
SU(5) GUT 
coupling constant unification 439 
lepton—quark mass relations 449-51 
proton decay amplitude 444 
QCD corrections 
ete” total cross-section 297-8 
IN scaling 303-4 
equations 
B functions 71-2, 


QCD (Yang-Mills) 286-7. 289, 295-6 
Yukawa 282-3 
SU(5) 439-40 
Callan—Symanzik 70-3 
other forms 75, 304, 307 
asymptotic solutions 74-7, 81, 304, 307 
~ functions 71-2, 77, 80, 83--4 
QCD 307, 311 
moments of Altarelli~Parisi functions 314 
physical basis 67~70 
summation of leading radiative corrections 82, 
316 
representation 87 
adjoint 89, 113, 432, 434 
basis states of 87. 90, 92 
complex 407-8, 434: see also real representa- 
tion 
conjugate 89, 102-3. 109, 118. 429 
dimension of 87, 92, 101. 106-7 
fundamental (defining) 87. 102. 109. 116, 429 
graphic representation of 95-6. 99-102 
irreducible 87, 92. 104 
product 93-6, 108-9 
real 89, 96-7, 434 
vector 103, 109, 434 
p-parameter 352, 364, 394 
experimental value 387, 419 
for weak isospin of Higgs 
T= 1/2 352 
T=1 419 
general T 394 
right-handed neutrino 418-20 
rotational group 89, 91-2 


SO(2), see U(1) 

SO(3), see SU(2) 

SO(4), see SU(2) x SU(2) 

SSB, see spontaneous symmetry breaking 
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SU(2) 90-7; see also SO(3) 
gauge theories 231 
Georgi-Glashow model 341, 466 
Higgs phenomenon 243 
quantization (covariant gauges) 259-63 
instanton 479 
Slavnov-Taylor identity and BRS trans- 
formation 273 
isospin flavour symmetry 114-15 
Noether theorem, 129 
SU(2) x U(1) 87, 340-1 
SU(2) x SU(2); see also O(4), chiral symmetry 
o-model, SSB and Goldstone theorem 149 
breaking to SU(2) and p = 1 352, 404 
SU(3) 97-102 
colour group 123-4 
QCD Lagrangian 291 
decouplet 116-20 
eightfold way, flavour symmetry 115-23 
octet 116-17, 119-21 
quarks, free field theory of 131 
symmetry breaking 119, 165 
weak current transformation properties 136 
SU(3) x SU(3) 
charge algebra 130 
current algebra 132 
SU(4) 102 
flavour symmetry 122 
lepton—quark mass relations in SU(5) model 
449 
tumbling 408-9 
SU(5) 90 
GUT 428-52 
tumbling 407-9 
SU(n) 90, 102-13 
scaling 199, 203-6, 279, 308, 312 
ete” 214-16, 223-4 
Drell-Yan process 216-18 
lepton—nucleon (Bjorken) 203-14, 218-19, 
224-6, 308 
functions 204, 205 
variables, x, y 203 
light-cone singularities 218-19 
violations 308, 310-11, 312 
Schrodinger picture 6, 11 
Schwinger terms 133, 157 
self-dual field 481 
self-energy 32, 34-5, 49-50, 68-9, 78, 449-50 
o-model 149-51, 403 
o-term 157, 160-1 
nN 157-60, 161, 165-6 
pseudoscalar meson masses 162, 164, 166-7 
simple group 87, 235, 430, 433, 469, 472 
semi-simple group 87, 346, 433 
singularities in product of 
field 219, 298-9 
currents 221—3, 300, 305 
Slavnov—Taylor identities 267-78 
soft photon theorems 156 
soft-pion theorems 
one soft pion 153-5 
two soft pions 156-60, 162 
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soft symmetry breaking 187 
and renormalizability 187-8 
spontaneous symmetry breaking 188 
solar neutrino puzzle 410 
solitons 460-9 
Ag* in 2 dimensions 460-4 
Agt in 4 dimensions 464-5 
gauge theories with SSB (°t Hooft—Polyakov 
monopole) 466-9 
spontaneous symmetry breaking (SSB) 125, 141- 
51, 240; see also chiral symmetries, 
Goldstone theorem, Higgs phenomenon, 
VEV 
fermion masses 151, 357, 407, 449 
pattern of 245-6 
radiatively induced 195-8, 396-8 
and renormalization 182-8, 264 
and solitons 464 
standard electroweak theory, see Weinberg— 
Salam theory 
string, Dirac 456-60, 470 
string model of hadrons 323 
string tension 323, 333, 335 
strong coupling expansion 325, 333-5 
structure function 138, 201, 204, 211-14, 305; see 
also form factor, scaling, parton distri- 
bution function 
flavour singlet combination 305, 308-10, 314 
flavour non-singlet combination 305, 310-12, 
314 
helicity 205, 208 
moments 212-14, 300-3, 308, 311, 313-14 
subgroup 86 
invariant subgroup 87 
subtraction 34; see also renormalization, regular- 
ization scheme 
point (normalization condition) 68-70, 78, 81, 
82, 281 
schemes 
momentum (conventional) 67-70; 
intermediate renormalization 68-9 
on-shell renormalization 69 
minimal (mass-independent) 78-80 
Sudakov variables 318-19 
summation of leading logarithm 82, 315-16, 322 
supersymmetry 290, 437 
symmetry; see also baryon number, CP, chiral, 
currents, global symmetries, gauge, lepton 
numbers, SSB, Ward identities 
and group theorem 86 
and mass degeneracy 113-14, 125-8, 142 
and renormalization 169 
symmetry factor 9, 11, 32, 187, 193, 269, 503-4; 
see also Feynman rules 


t-quark 122, 356 
tadpole diagrams 182, 183 
and shift in VEV 182-6 
and massless Goldstone bosons 186-7 
tau-leptons 356 
technicolour 401-9; see also extended technicolour 
scale parameter ^c 405 
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tensor representation of SU(n) 102-13, 428 
0-vacuum 486-7, 490-3; see also strong CP 
problem, axial U(1) problem 
°t Hooft-Feynman gauge 263, 268, 277, 379, 426 
t Hooft-Polyakov monopole 453, 466-75; see 
also magnetic monopole, solitons 
charge quantization and simple group 469, 472 
coupling of spatial and internal symmetries 471 
as Dirac monopole 469-71 
SO(3) 466-72 
SU(5) 472-5 
chromomagnetic charges 474 
catalyses proton decay 474-5 
(3, 3*) © (3*, 3) model 162; see also chiral sym- 
metry breaking 
top, see t-quark 
topological conservation law 463-4 
topological winding number 477-9, 480, 483-7 
tumbling 407-9 
twist 299-300, 305, 309 


U(1) 86; see also SO(2) 
axial U(1) problem 294, 487 
gauge symmetry 229-31 
Goldstone theorem 147-9 
at one-loop level 186 
Higgs phenomenon 241-3 
Noether theorem 128-9 
vector Ward identity, radiative correction 170- 
3 
unitarity 
Abelian gauge theory 263-4, 266 
and renormalizability in weak interaction 
theory 338, 342-5 
and Ward identities 268-73, 278 
unitary gauge 242, 244, 263, 349-51, 426 
unitary group 86 
universality 
Cabibbo 365, 373 
gauge theory 231, 235 


Y 122; see also b-quark 
VEV, see vacuum expection value 
V-A theory 135, 204, 336-8, 353, 364 
vacuum 18 
QCD 323, 493 
degenerate vacua in theories with SSB 143, 464 
multiple vacua with different winding number 
482-6; see also -vacuum 
vacuum expectation value 
condition for SSB 143 
effective potential formalism 189 
one-loop perturbative shift 182-6 
sizes in electroweak unification 353, 402, 436 
in GUT 436 
vacuum polarization 297, 383, 404 
vacuum saturation approximation 380 
vacuum-to-vacuum amplitude 18, 29, 189, 254, 
259, 275, 484, 487; see also generating 
functional 
vector—axial-vector ambiguity in neutrino re- 
actions 367 
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vector meson masses 
gauge fields 231, 235 
hadrons, see Gell-Mann—Okubo formula, w—@ 
mixing 
Higgs phenomena 240-7 
and renormalizability 60-2, 264, 338-9 
W and Z 351-2, 355, 387-8 
vertex renormalization constant 37, 285 


W; see also Z 
in pre-gauge-theory models 338-9, 342-4 
in standard SU(2) x U(1) theory 351-3, 379, 
384-94 
decays, leptonic branching ratio 389 
Higgs 390 
width 389 
mass 387-8 
production in Drell-Yan process 394 
WKB approximation and imaginary time 484-6 
Ward identities 156, 169, 267-8 
vector current in Af* 170-3 
axial-vector current in QED 173-9 
and unitarity 267, 268-73 
non-Abelian gauge theories (Slavnov—Taylor 
identities) 272-8 
weak 
axial nucleon coupling, see ga 
charged currents, see currents 
eigenstate, see gauge interaction eigenstate 
hypercharge 346-8, 431, 439 
mixing angles, see Cabibbo angle, KM mixings, 
lepton mixings, Weinberg angle 
neutral currents, see currents 
radiative correction to QCD Lagrangian 294-5 
weak interaction theories 
Fermi 336 
Georgi-Glashow SQ(3) 341-2 
IVB 338-9 
standard SU(2) x U(1), see Weinberg—Salam 
theory 
V-A 135, 204, 336-8, 353, 354, 364 
Weinberg angle 
definition 352 
W-Z mass relation 352, 364, 387 
value from neutral current processes 354, 365 
pure leptonic 365-9 
VN scatterings 369-70 
SU(5) prediction 439-41 
Weinberg—Salam theory; see also CP symmetry, 
GIM mechanism, generation, Higgs 
phenomenon, Kobayashi-Maskawa mix- 
ing, weak neutral current, Weinberg 
angle 
choice of SU(2) x U(1) group 340-5 
fermions 
anomaly cancellation 347-8 
neutral current couplings 354, 365, 391 
quantum number assignment 345-6 
global symmetries 355-63 
baryon and lepton numbers 355 
CP violation 359-61 
muon number 362 
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Weinberg—Salam theory (cont.) 
Higgs particle 348-50, 394-401 
p = 1 and isodoublet structure 352, 394 
massless neutrinos 361, 417-18 
SSB 348-50 
W and Z gauge bosons 351-4, 386-94 
Weinberg’s theorem 
on asymptotic behaviour of Green’s functions 
73-4 
on convergence of Feynman integrals 42-3 
Wick rotation 47 
Wick theorem 8-10, 21, 62, 221 
Wilson coefficient 298, 300-2, 303-4 
Wilson loop 331-4 
winding number, see topological winding number 
would-be-Goldstone boson 243, 244, 247, 263-7, 
351, 420-6 


Yang-Mills fields, see non-Abelian gauge fields 
Young tableau 104-9, 117-19; see also tensor 
representation of SU(n) 
conjugate representation 109 
dimension of irreducible representation 106-8 
distance to the first box 107 
fundamental theorem 106 
hook length 107 
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lattice permutation 108-9 

product representation 108-9 

standard tableau 105-6 

Yukawa couplings 349-50, 357, 395 

B-function 282-3, 290 

fermion masses in SSB theories 
o-model 151 
Weinberg—Salam theory 349-50, 357, 395 
extended technicolour theories 406 
neutrino masses 418-19 
SU(5) model 448-51 

global symmetry properties 
CP 386 
B and L in electroweak theory 355 
B — L in minimal SU(5) 451-2 


Z 351-4, 364; see also W 
decays, branching ratios 392 
counting neutrino flavours 393 
neutrino pair final states 391 
Higgs 392, 399 
mass 387-8; see also p-parameter 
production in Drell-Yan processes 394 
ZWW couplings 393 
Zweig rule 121-2, 165 


